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The purpose of this paper is to survey the present status of our 
knowledge of the relative variability of mental traits and especially to 
examine critically the various generalizations and ‘“‘laws’’ that have 
been proposed by writers on this subject. 

The usually accepted method of computing relative variability is 
to use the formula devised by Karl Pearson for the Coefficient of Vari- 
ation (V), according to which V=100S.D./M. We divide the standard 
deviation (S.D.) of the distribution by the mean (M) of the distribu- 
tion to two decimal places and by multiplying by 100 we eliminate the 
decimal. This makes it possible to compare the variabilities of charac- 
teristics which have been measured in different units such as pounds, 
inches, seconds and the unit scores of some psychological tests. It also 
makes an allowance for differences in averages when measurements are 
in the same units. 

If we wish to apply the above formula to psychological test results 
we encounter in many cases a serious difficulty which is overlooked in 
some discussions of the subject. In order to use the V formula for most 
comparisons the measurements used must be arrived at by using scales 
with true zeroes and the scales must be in approximately equal units. 
It is especially important that V scores not be calculated for error scores 
or time scores which decrease as performance improves. In these 
cases zero scores would mean perfect performance rather than the 
absence of any ability. 

Unfortunately the zeroes of psychological tests are commonly not 
true zeroes of ability. As a result the mean is usually too small, so that 
when we divide S.D. by M, the resulting V is too large. If then we com- 
pare V’s on two tests which have zeroes at relatively different distances 
from their true zeroes the results cannot be interpreted accurately. 
This error is most serious when dealing with higher and more complex 
functions such as memory and reasoning. 
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While the Pearson method of computing relative variability has been 
generally used, other methods have had supporters. Thorndike (86, 
p. 9) suggested that the square root of M be used instead of M in the 
Pearson formula. This change increases relative variability in some 
cases where the ‘use of M gives decreases. Yule (109, p. 49) suggests 
that deviations be compared by determining their ratios to the geo- 
metric mean. Wechsler (98) after eliminating pathological extremes 
determines the ratios of the highest to the lowest measurements. This 
method seems to have been used because of Wechsler’s hypothesis that 
the ratio of the highest to the lowest measures was limited by the mathe- 
matical constant e (2.718..). This, however, does not seem to hold for 
some simple sensory functions. Woodrow (105) changes raw scores to 
scaled scores and compares S.D.’s to determine the effect of practice. 
Peters and Van Voorhis (67, pp. 78 ff.) throw the baby out with the 
bath water: they suggest that the zeroes of all distributions be placed 
about 3 sigmas below the mean where the scores begin to diverge. In 
normal distributions this would make all V’s equal to about 33 and 
hence useless for comparison. 

These differences in method are important because the conclusions 
reached may depend on the method used. However, since most students 
accept the Pearson formula we shall use it as a basis for further dis- 
cussion. 

HISTORICAL BACKGROUND 

The earliest suggestions of laws of variation were, quite naturally, 
made by biologists, but some of these have psychological implications 
and should be mentioned. 

In 1809 Lamarck (48) opened the question as to the natural causes 
of variations by advancing his well-known idea that organisms vary in 
response to environmental stimulation and in a way to meet environ- 
mental requirements. Characteristics become stronger through use or 
weaker through disuse, and these changes are transmitted to offspring. 

Fifty years later Darwin (12) included a chapter on ‘‘Laws of 
Variation”’ in The Origin of Species. He accepts the Lamarckian idea 
about use and disuse but places less emphasis on it and emphasizes 
variation from other ‘‘... causes of which we are quite ignorant.”’ 
Among his conclusions are these: rudimentary organs are more variable 
than ordinary organs; highly developed organs are unusually variable; 
specific characters are more variable than generic characters; and lower 
organisms are more variable than higher organisms. 

Havelock Ellis (22), in 1894, developed and emphasized the idea 
that men are more variable than women. The evidence for this claim 
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rested largely on physical characteristics. Three years later, Pearson 
(66) examined at some length the problem of sex differences in varia- 
bility in physical traits. He concluded that evidence was lacking for 
greater male variability. Pearson also concluded from the study of 
cephalic indices that civilized races are relatively more variable than 
primitive races. 

In 1902 Vernon (95) published a very important biological study of 
variation with incidental psychological references. This was based on a 
careful statistical treatment of measurements. After studying varia- 
tions in physical growth he formulated a tentative law as follows: 
“The variability of a developing organism diminishes regularly with its 
growth” (p. 206). Minot and Pearson, working indepeiidently, appar- 
ently did the most important work which furnished the basis of this 
law. Vernon agrees with Minot that this reduction in relative variabil- 
ity during growth probably occurs in all mammals. Psychologists 
naturally will ask: Is this true for mental traits? 

Vernon also discussed the possible influences of genetic selection 
and of natural selection on variability. In both cases, relative variabil- 
ity would be decreased. 

In 1895 Binet and Henri (3, p. 417) after reviewing studies made up 
to that time in the psychological field conclude: 

Among the results which emerge from all these studies we shall cite a few: 
The first, the most important of all, we believe, is that the higher and more 
complex a process is the more it varies from individual to individual: sensations 
vary from one person to another, but less than memory, memory of sensation 
varies less than memory of ideas, etc. It follows from this then that if one wishes 
to study the differences between two individuals it is necessary to begin with 
the highest and most complex processes and it is only secondarily that we need 
to consider the simple and elementary processes: this is, however, the opposite of 


what has been done by the great majority of authors who have treated this 
question. (Translated by the writer.) 


This statement was made of course before the statistical treatment 
of mental measurements had become common. It is probably based 
primarily on observations of qualitative differences. 

I have not seen the first edition of Stern’s Differentielle Psychologie 
which appeared in 1900 but in the third edition (83, pp. 257 ff.) he 
accepts the conclusion of Binet and Henri that complex traits are more 
variable than simple ones. To this he adds the idea that there is a 
positive correlation between the time of appearance (phylogenetically 
and ontogenetically) of a trait and its variability. Hence traits appear- 
ing in adolescence would be more complex and more variable than 
those appearing in infancy. He expresses the idea that relative varia- 
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bility should be determined by finding the percentage which a deviation 
is of the mean. 

A study made by F. L. Wells seems to have suggested to Stern the 
idea of comparing the variability of traits within an individual with the 
variability of a single trait in a group of individuals. He concludes 
that higher processes will vary more than lower processes in a single 
individual. Presumably this would mean, for example, that mathe- 
matical ability and linguistic ability would show more independent 
variability in the same person than would auditory and visual acuity. 

Thorndike (87, p. 317) formulates two tentative conclusions about 
relative variability as follows: 


1. The variations are, in general, greater in acquired than in original traits. 
2. They are, in general, greater in traits peculiar to man than in traits 
characteristic of all mammals. 


Studies of sex differences are reviewed for evidence of a difference in 
relative variability and Thorndike concludes that women are probably 
slightly less variable than men (pp. 193 ff.). Several studies made in 
the period from 1908 to 1914 on the effects of practice on individual 
differences are reviewed (pp. 304 ff.) but the results are inconclusive. 
However, he at least formulates the problem. 

H. L. Hollingworth (41, pp. 74-84) discusses the problem of rela- 
tive variability and concludes that variability is probably greater for 
traits that are more complex, more functional, more recent (phylo- 
genetically or ontogenetically), more specific, more symbolic, less used 
and less relevant. 

Hull (42) makes a study of the question of variations in traits within 
the individual. He calls this “trait variability’’ and reaches a tentative 
conclusion that such trait variability is about 80 per cent of the vari- 
ability of single traits in the population. 

Wechsler (98) raises the question as to the range of human capaci- 
ties. After eliminating the pathological extremes, he determines the 
ratio between the measurements of the highest and the lowest mem- 
bers of a group and draws the conclusion that with very few exceptions 
this ratio does not exceed 3.0:1 (p. 62). Wechsler emphasizes the point 
that while individual differences are real and are important they are 
not nearly as great as is commonly assumed. He believes this conclusion 
has an important bearing on the question of democratic government 
and on related social and economic problems. 

Wechsler also subscribes to the idea that complex traits are more 
variable “‘ . . . since the variability of any given phenomenon is neces- 
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sarily the function of the product of the variabilities of the individual 
factors which determine it’’ (p. 59). 

This brief review will suffice to indicate the chief points in the his- 
torical background of current ideas of relative variability. Before 
attempting to pass judgment on the accuracy of these and other gen- 
eralizations it will be useful to have a look at the problem of variation. 


THE MECHANISM OF VARIATION 
Most normal psychological characteristics show continuous varia- 
tion from the very low to the very high degrees of the trait. Geneticists 
TABLE I 


THE RELATIVE VARIABILITIES (V) Or DisTRIBUTIONS RESULTING FRoM CHANCE 
CoMBINATIONS Or Coins AnD DICE 








S.D. M V S.D. M V 








4 Coins 1 6 17 
9 Coins 1.5 13:3 11 
4 Dice 3.416 14 24 
9 Dice 
Theoretical § .123 31.5 16.3 
Empirical 5.073 32.3 15.7 





Heads and tails on the coins are counted as 1’s and 2’s respectively. Four coins falling 
three heads and one tail give a total of five. 

The S.D.* of the first N natural numbers, 6 in case of dice and 2 in case of coins as 
used above, equals (N?—1)/12 (Yule, 109, p. 143). When we add equipotent independ- 
ent variables the S.D.* varies as the number (m) of the variables. Hence the S.D. of the 
distribution resulting from tossing m dice or coins is given by the formula: S.D.? 
=n(N?—1)/12. For nine dice this would be: S.D.2=9(68—1)/12 =26.25. S.D.=5.123. 
As an empirical check, nine dice were tossed 200 times with results as given under 
“Empirical.” 


usually interpret this to mean that differences in the strength of in- 
herited characteristics are due to multiple factor inh<ritance: the 
strength of the characteristic is determined by the joint influence of 
a number of genes. This hereditary potential would be affected by 
environmental factors to a greater or lesser degree, so that the final 
result would depend on both kinds of factors. 

In order to get examples of variation under conditions where we 
understand clearly the material with which we are dealing, some re- 
sults, both theoretical and empirical, from certain chance combinations 
of coins and dice are presented in Table |. 
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When two factors combine to produce a more complex total the 
combination may be an additive process or it may involve multiplica- 
tion with the result a product of the elements. Psychologists usually 
accept the additive assumption. The addition of normal distributions, 
as in case of our dice problem, gives a normal distribution while prod- 
ucts give a skewed distribution. Since distributions of complex psycho- 
logical traits are commonly approximately normal this seems to justify 
the tentative use of the additive method. The difference between these 
two assumptions, as we shall see later, is vitally related to the problem 
as to whether simple or complex traits are relatively more variable. 


THE “‘LAaws”’ OF RELATIVE VARIABILITY 
Complexity vs. Simplicity 

If we examine Table I we find that, whether we are dealing with 
coins or dice, as we pass from the totals for four variables to nine vari- 
ables the S.D.’s increase and the V’s decrease. This happens because 
the M’s increase in proportion to the number of variables while the 
S.D.’s increase in proportion jo the square root of N. On this basis 
then an increase in complexity results in an increase in absolute vari- 
ability but a decrease in relative variability. 

If, however, we compare the totals for four coins and four dice we 
have the same number of variables in each case, but with an increase 
in the variability of the variables, and under these conditions, both the 
S.D.’s and the V’s increase. 

On this basis if we assume intelligence to be made up of capacities 
for perception, memory, reasoning and such other behavior as the 
reader wishes to add, the value of V for intelligence should be lower 
than the V’s of its components. In another population where intelli- 
gence has the same components but where the components themselves 
varied more widely in strength the relative variability of intelligence 
would be greater than in the first population. 

According to accepted statistical principles we may administer two 
tests to the same group of subjects and after computing the M’s and 
S.D.’s of the two tests separately we may determine the mean of the 
sum of the two tests simply by adding the means of the two separate 
tests, and we may obtain the S.D. of the summed distribution by using 
the formula for that purpose (109, p. 211). From this it follows that 
the value of V for the sum of any two tests can never exceed the higher 
of the two values of V for the two tests which were combined—provided 
the test scores were of such a character that it was legitimate to compute 
a V in the first place. In this sense it is a statistical impossibility to in- 
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crease the value of V by combining test scores. If then complex traits 
are considered to be the sum of simple traits, they must be relatively 
less variable. 
TABLE II 
COEFFICIENTS OF VARIATION (V) For DIFFERENT CHARACTERISTICS 








Trait 


Subjects 

















Weight of spleen (6) 54 normal males, 30-40 years old 45% 
Olfactory acuity (4, 34) Assorted adults 30+? 
“ Visual acuity (20, 47, 77) 805 17-20 yr. olds plus others 27+ 
Strength of back (36) Men 17-30 years old 228 
Simple visual reaction time (29, 35) 5564 English adults plus others 2114 
Strength of grip (106) 609 16 year old boys 21% 
Weight of liver (33) 73 English adults 208 
Auditory acuity (68) 61 young adults 19+ 
Vital capacity (106) 599 16 year old boys 19% 
Card sorting (72) 148 16 year old girls 17 
Visual perception span and speed (58) 98 males, 18-29 years old 155 
Memory span, digits (98) 236 male adults 15§ 
: Tapping (106) 615 boys, 16 years old 134 
Highest audible pitch (47) 805 17-20 year olds 12 
Body weight (43) U. S. soldiers in 1917 12 
Weight of cerebrum (65) 308 English adult males 11 
Py Brain weight (65) 416 English adult males 9 
4 Cranial capacity (51, 65) English adult males 8 
be Knee height (43) U. S. soldiers in 1917 8 
5 Leg length (43) U. S. soldiers in 1917 7 
Be Stature (43) U. S. soldiers in 1917 4 
i CAVD intelligence (88) Assorted Ka 
Body temperature (102) 601 convicts 0.55 








1M, S.D. and V calculated by the writer. 

*'V estimated by the writer from percentage distribution. 
*'V calculated by the writer from percentile distribution. 
‘ Based on reciprocals of time scores. 

*'V calculated by the writer. 


If we turn from statistical theory to empirical data to test our con- 
clusion we confront the fact that most intelligence tests do not have 
true zeroes. Hence we cannot compute a true value of V for them. 
An exception is found in the CAVD test worked up by Thorndike and 
others (88). The mean adult score on this test is given as 36.5 and the 
S.D. is 1. This gives a V of 2.74. By examining Table II it will be seen 
that this is comparatively a very low value and agrees fully with 
theoretical expectations. 
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Even if we make a very large allowance for error in Thorndike’s 
determination of the zero of the CAVD scale and multiply the value of 
V by 3 it would be only 8.2, which is relatively a low value. 

Thurstone (89) also has attempted to determine the true zero for 
mental tests. His method is based on the assumption that relative 
variability of intelligence remains constant during growth. On a priori 
grounds the writer questions this assumption because all of the results 
he has seen for physical, psychomotor and sensory traits measured by 
objective scales with true zeroes show variations in V’s with age. Under 
these conditions it would be very surprising if the V’s for intelligence 
did not change with age. In the second place Thurstone’s results are 
not consistent. One application of his method to Binet Test results 
(89, p. 196) gives a V of 7.2 (M =13.9, S.D. =1) while another applica- 
tion (90, p. 574) gives a V of 37.6 (M =2.66, S.D.=1). This difference 
can be only partly accounted for by the difference in the variability of 
the two groups tested. The growth curve in the first instance is a con- 
ventional negatively accelerated growth curve similar to those usually 
found. In the second instance the curve is first positively accelerated 
and later is negatively accelerated. These disagreements would seem 
to afford ample basis for questioning the validity of Thurstone’s 
method. 

Ability as represented by a total score is certainly more complex 
than the abilities represented by the separate test scores. Yet the V’s 
for total scores, as noted above, are regularly lower than the V’s of at 
least some of the individual tests. An example of this may be had from 
the VACO test results reported by Freeman and Flory (25, p. 42) for 
children aged 13 years. They report V’s as follows: vocabulary 20.3, 
analogies 23.9, completion 20.3, opposites 27.9, and total 18.5. 

If we consult Table II we find that with the exception of the weight 
of the spleen the highest V’s are for olfactory and visual acuity, strength 
of back and of grip, and reaction time. These are hardly what psycholo- 
gists would usually call the most complex traits. Visual perception span 
and memory span for digits are near the middle of the list. 

Additional evidence that simple functions are relatively more vari- 
able than complex functions is provided by some studies not included 
in Table II. ;' 


Leukhart (49) gives the times for monocular accommodation for 
14 subjects. Since higher time scores mean slower and poorer accom- 
modation I have calculated the value of V for the recipfocals of these 
time scores and find it to be 40.6. Warden, Brown and Ross (96) de- 
termined the threshold for motion acuity for 28 subjects at scotopic 
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levels of illumination. This is stated in terms of angles and the Jowest 
scores indicate the greatest visual acuity. For this reason I have again 
determined the V based on reciprocals and find it to be 147. Since one 
extreme case is responsible for a large part of this value, I determined 
V for the remaining 27 cases and found it to be 74—still a very high 
value. In both of the above studies the number of cases is rather small 
and the sampling is limited to groups of generally superior young adults. 
However, when we find wide variability under such conditions it ap- 
pears improbable that we should not find it with a larger and less 
selected sample of the population. 

Hermans (38) had 100 subjects observe the size of a standard. 100 
mm. aperture with binocular, with monocular and with pinhole vision 
and attempt to match this by adjusting a different aperture with 
binocular vision. Table III gives the results. M’s and S.D.’s are from 
Hermans, V’s have been added by the writer. Relative variability in 
responses is clearly greater with the simpler pinhole vision and less with 
the higher and more complex binocular vision. 


TABLE III 


RELATIVE VARIABILITIES OF JUDGMENTS OF S1zE AT THREE LEVELS OF 
Vision (HERMANS, 38) 





M S.D. V 








Pinhole vision 67.25 23.05 34 
Monocular vision 93.70 14.30 15 
Binocular vision 104.74 6.54 6 








McGeoch (52) used three groups of nonsense syllables of different 
levels of association value, as determined by Glaze, and added a group 
of three-letter words. The 00% syllables are those for which subjects 
reported no associations, the 53% syllables are those for which 53 per 
cent of the subjects reported associations, and so on. McGeoch de- 
termined the amounts of these that were learned in a given period of 
time. Table IV gives the results. M’s and S.D.’s are from McGeoch. 


TABLE IV 


THE RELATION OF RELATIVE VARIABILITIES TO THE ASSOCIATIVE VALUE OF 
SYLLABLES (McGeEocnH, 52) 

















Material Ae 3 apt S.D. V 
3-letter words 9.11 1.12 12 
100% syllables 7.35 1.96 27 

53% syllables 6.41 2.37 37 
00% syllables 5.0 2.60 
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V’s are added by the writer. This table shows that as material be- 
comes more meaningful, and hence presumably involves more complex 
learning functions, there is relatively less variability in the amount 
learned. 

While the zeroes of these scores may not be true zeroes, that can 
hardly account for the obtained differences. Unfortunately it is not 
possible to extend this type of analysis to a study of higher levels of 
rational memory because of the fact that nothing approaching a suitable 
measuring instrument is available. 

In the psychological field the evidence certainly does not support 
the generalization that complex traits are relatively more variable— 
rather the opposite. This conclusion is further supported by considering 
anatomical measurements. 


Some of the internal organs such as the spleen and the liver are 
relatively the most variable. If one questions these results because of 
admitted difficulties in securing a satisfactory selection of human bodies 
to dissect, it may be noted that the same relation is found for laboratory 
animals. Thus Donaldson (16, p. 225) reports V’s for weights of the 
rat as follows: brain 10, body 19, heart 24, liver 25, thymus 34, and 
ovaries 43. As in man, body weights show less variability than is 
shown by some of its parts. The brain, which is hardly a “simple” 
structure, is relatively much less variable than the internal organs such 
as the spleen and the liver. Likewise, in Table II, consider the series— 
stature, leg length, and knee height. Stature is more complex and rela- 
tively less variable. Similarly the cerebrum is relatively more variable 
than the whole brain and the latter is relatively more variable than the 
brain case. 

In contrast we may note that body weight is relatively more vari- 
able than stature. As a matter of geometry the body is a solid and has 
three dimensions. Similar solids vary as the cubes of their single 
dimensions—and cubes are products of three dimensions. In our dice 
and coin illustration we worked with sums and the results seemed to 
agree with the psychological test results. If, however, we deal with 
products the result is different and then Wechsler, as quoted above, 
would be right when he says that complex characteristics are relatively 
more variable ‘‘since the variability of any given phenomenon is neces- 
sarily the function of the product of the variabilities of the different 
factors which determine it”’ (98, p. 59). 


While this generalization is true when we compare stature and 
weight it does not appear to be true when we compare psychological 
characteristics. Psychological test results agree better with the addi- 
tive assumption usually made, and, as I shall point out later, the as- 
sumption of subtraction is helpful in understanding certain changes in 
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relative variability associated with old age, fatigue and forgetting. 
Certainly there is no good psychological, biological or mathematical 
reason why we must assume that psychological variation is always 
and “necessarily”’ a function of products rather than of sums or dif- 
ferences. 

Functions vs. Structures 


Hollingworth’s second law states that functions are more variable 
than structures. From Table II we may see that the lowest V’s are for 
functions while the highest is for a structure. With advancing age rela- 
tive variability of visual acuity climbs rapidly and is above 90 by the 
age of 60 (77, p. 85). It might be difficult to match this with an example 
of structural variability. But in young adults and in children several 
structures are relatively more variable than visual acuity. Brain weight 
is relatively more variable than CAVD intelligence. A very wide 
range of V’s can be supplied for both structures and functions, hence it 
would appear that this ‘‘law’’ hardly agrees with the facts. 


Symbolic vs. Concrete 


Since CAVD intelligence is distinctly symbolic and since the ability 
to handle symbols is presumably complex it hardly seems necessary to | 
discuss this ‘‘law’’ separately. 


Recent vs. Ancient 


Stern, Thorndike and Hollingworth have supported the idea that 
traits recently acquired—whether phylogenetically or ontogenetically— 
are more variable than those of more ancient vintage. This seems to be 
in conflict with the principle supported by Vernon that the variability 
of a developing organism decreases with growth. Also it appears to be 
in conflict with the fundamental biological theory of evolution. Thus 
Shull (81, p. 243) writes as follows (italics are my own): 

The similarity of the species of a genus is held to indicate kinship, but since 
there is greater diversity among the individuals of a genus than among the members 
of a species, the common stock from which the species of a genus have sprung 


must have existed at an earlier time, in order that evolution could bring about 
the degree of divergence now observed. 


Reptiles and lower animals are ‘‘cold-blooded’’ because they lack 
our temperature regulating mechanism. Yet our body temperatures 
show much less relative variability than various “‘older’’ functions. 
Aiso body temperature is less variable in adults than in children. The 
higher levels of CAVD intelligence are late in appearance both in the 
race and in the individual and show relatively small variation. The 
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sense of smell was originally the dominant distance receptor and the 
cerebrum first began to develop as an olfactory correlation center. Yet 
at the present time and in young human adults olfactory functions 
appear to be relatively more rather than less variable than visual and 
auditory functions. Auditory acuity develops phylogenetically after 
visual acuity and seems to be relatively less variable. Musical talent 
can hardly be said to appear early in the race but it does appear early in 
childhood and it seems to be more variable than abstract intelligence. 

This generalization that traits more recently acquired—whether 
individually or racially—are more variable than older traits is probably 
a deduction from the supposedly greater variability of complex func- 
tions. It appears to be equally erroneous. 


Specific vs. Generic 


Darwin says: “‘It is notorious that specific characters are more vari- 
able than generic.”” And “... the points in which all the species of a 
genus resemble each other, and in which they differ from allied genera, 
are called generic characters.’’ But since the nearest relatives of Homo 
sapiens are known only through fossil remains and since biologists are 
not agreed (Dobzhansky, 14) on the classification of these fossils into 
genera and species it seems unwise to attempt to discuss this principle 
as Darwin has defined it. 

Hollingworth’s statement of the principle is that specific and less 
widespread traits are more variable than those that are more generic. 
Interpreted in this way it means about the same thing as Thorndike’s 
second law that variability is greater in traits peculiar to man and as 
Pearson’s claim that civilized man is relatively more variable than 
primitive man. It is also related to the question discussed above under 
“Recent vs. Ancient.” 

Among the important differences between man and our nearest 
surviving relatives, the gorilla and the chimpanzee, are man’s upright 
posture, longer legs than arms, more uniform teeth, virtual absence 
of hair from most of the body, smaller face, and greater brain and 
intelligence. 

Young human adults do not appear to vary greatly in upright 
posture when standing or walking. Schultz (78, 80) finds less relative 
variability in face, head, trunk and limb measurements in man than in 
some lower primates. Some primates are relatively more variable in 
arm length than man is in leg length.Relative absence of hair from the 
human body and limbs isa striking case of uniformity in a specific char- 
acter. This is especially true of the human back. We have already 
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noted that the human brain and CAVD intelligence are not strikingly 
variable. On the other hand, our internal organs are generic and some 
of them are very variable. Visual and olfactory acuity are quite generic 
and are also quite variable. 

All members of the Order primates are alike in having one head, one 
trunk, four limbs, two ears, one tongue, etc., but if we are studying 
quantitative variations in characteristics—and that happens to be our 
present problem—we find some of the highest coefficients of variation 
for characteristics which are common not merely to the members of a 
genus but to the members of an entire family, order or class. The dis- 
tance between the nipples of human females has a V of about 20 
(92, p. 108). This is a mammalian characteristic and is much higher 
than the V’s for most linear measurements. Wide variations do occur 
in a single species, especially in domesticated animals which have been 
subjected to controlled breeding and selection, but even here the 
members of a breed of dogs or chickens tend to be alike rather than 
different. However, if less widespread traits are more variable than 
more generic traits it would seem that a single breed of dogs or chickens 
should show very wide variation in those characteristics which are 
peculiar to the breed. 


Less Relevant vs. More Relevant 


Some writers have held that the variability of traits is inversely 
correlated with their biological relevance. This means that the less 
important the trait is for survival the more it may be expected to‘vary 
in the species. This seems to be based on the apparently reasonable 
deduction that the greatest variability cannot exist in things which are 
closely related to life. Against this is the fact that some of the internal 
organs are highly variable. Sensory capacities are essential for environ- 
mental adjustments but some of them are very variable. In lower ani- 
mals many examples could be given of characteristics which seem to 
have little or no survival importance but which are rather uniform in a 
species. “‘A survey of the characters which differentiate species (and 
to a less extent genera) reveals that in the vast majority of cases the 
specific characters have no known adaptive significance’ (Robson and 
Richards, 76, p. 314). Yet it seems obvious that if these characters were 


not reasonably uniform in a species they could not be used to differ- 
entiate species. 


Less Used vs. Most Used 


According to Hollingworth (41, p. 78), ‘“Infrequently used traits 
are more variable than are traits or activities more constantly em- 
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ployed.’”’ As a psychological principle we might make something of a 
case for this ‘‘law’’ in connection with the acquisition of skills in the 
individual. This problem will be discussed under the effects of practice. 


The Effects of Practice 


Thorndike formulated and defended the idea that acquired traits 
are more variable than native ones. This is tied up with a lively con- 
troversy over the question as to the effects of the environment, including 
education and practice, on individual differences. Given certain indi- 
vidual differences, how will these be affected by adding certain variables 
in the form of practice or training? 

If we consider the total “‘scores’”’ arrived at by tossing m dice to 
represent ability before training and that the effects of practice could be 
represented by adding to these original scores some new scores secured 
by tossing p additional dice, each child’s achievement after practice, 
could be represented by the total of n+p dice. If this situation holds 
it follows from previous discussions that the normal effect of education 
and practice is to reduce relative variability: 

We have evidence, however, that our m and p variables in school 
work are not independent variables but are negatively correlated. The 
dullest students in school are prodded most and work hardest. Fre- 
quently the brightest students are permitted to loaf. Thus May (55) 
in a study of college students found a correlation of —.35 between 
intelligence and the time spent in study. Drake (17) gave tests to 
college classes in biology and in history at the beginning and at the 
end of the semester, converted the raw scores to standard scores, and 
subtracted scores on the first test from scores on the second test to de- 
termine gains. He found negative correlations between first scores and 
gains and between intelligence and gains. Kelley (45) challenged the 
then current conception and held that formal education especially at 
the elementary school level reduced individual differences in several 
respects. Meltzer and Bailor (56) tested psychology students at the 
beginning and end of a course and found men decreased from 40 to 21 
and women from 49 to 14 in relative variability in knowledge of the 
subject 

One method of studying the effects of practice and training is to 
compare some of the results secured from intelligence tests and achieve- 
ment tests. If these results are expressed in terms of age or grade 
norms and the range from the tenth to the ninetieth percentile points 
is determined we find that variability in achievement is usually less 
than variability in intelligence. For example, at the age of ten years the 
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tenth to ninetieth percentile range on the revised Stanford-Binet test 
is 4.15 years (85, p. 40). On the Stanford Achievemert Test, Cornell 
(11, p. 87) finds the corresponding range to be 3.4 years. This method 
largely avoids the difficulty imposed by the fact that the tests do not 
have true zeroes. The results do not justify the conclusion that or- 
dinary school training increases individual differences within the trained 
group. 

In controlled practice experiments gross gains are usually positively 
correlated with initial scores, but percentage improvement and initial 
scores are negatively correlated. Individuals with lower scores make 
relatively larger gains than those with larger scores. 

Kincaid (46) reviewed 24 experiments on the effects of practice and 
found negative correlations between initial scores and percentage im- 
provement in 22 of the studies. In 19 of the 24 experiments larger S.D.'s 
were found after practice than before, but in 16 of the 24 cases V’s were 
smaller than before. Both the S.D. and the M usually increase with 
practice but the latter increases more, with the result that relative 
variability usually decreases. 

Reed (73, 74), reviewing practice studies to 1931, concludes that 
in 77 per cent of the studies (V=70) V decreases with practice, and in 
93 per cent of the studies (N=58) the correlations between initial 
performance and per cent improvement are negative. 

Burns (8) reviews 84 practice studies and agrees essentially with 
Reed in his findings. He suggests that one of the important reasons for 
differences in the results of practice studies is to be found in differences 
in motivation. 

Additional practice studies have been made since 1937 but they have 
not changed the general picture nor have they made it clear why in 
some cases relative variability increases with practice. A recent study 
by Yoshioka and Jones (108) of stylus maze learning reports that with 
practice there is a sharp increase in relative variability in errors but 
not in time scores. In the latter case the work unit was constant and 
time scores decreased as learning progressed. In both cases then V’s 
have been computed for scores for which zero would be a perfect score 
(though impossible in case of time scores). Yet, as was noted earlier, 
we cannot legitimately use V scores unless the scale has approximately 
at least a true zero. 


A hypothetical example will clarify the difficulty. Children given a spelling 
test have an average of 80 right, 20 wrong, S.D.=5. After practice they test 
96 right, 4 wrong, S.D.=2. V’s based on “rights’’ are 6 before practice and 2 
after practice. Based on “errors,’’ V's are 25 before practice and 50 after prac- 
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tice. Clearly the true variability in knowledge of spelling is not represented 
by a V of 50 after practice. For this reason before V’s are computed all scores 
which show improvement by decreasing scores must be converted to a form in 
which increasing ability is represented by increasing scores. Otherwise the V 
scores may show the opposite of the true situation, though V’s based on time 
scores are usually more nearly correct than V’s based on error scores. 


Relatively few experiments have shown statistically signific2nt in- 
creases in V’s after practice. Further study of the exceptions is needed 
to explain why they give different results. If we took 100 individuals, 
divided them into four equal groups of equal initial ability and then 
gave distinctly different amounts of practice to each group, relative 
variability for the total group would normally be greater after practice 
then before. When environment operates differentially in this way, as it 
apparently does at times, relative variability might be increased. Those 
with higher intelligence tend to remain in school longer so that amounts 
of practice are distinctly unequal. This should produce a wide spread 
in achievement along particular lines. But to measure this fairly we 
must have tests with true zeroes and with equal units. 


Forgetting 


To a considerable degree forgetting is a reversal of the process of 
learning. In terms of our dice illustration it is possibly similar to what 
would happen if we started with “‘scores’”’ based on throwing n dice and 
subtracted from each score the scores resulting from throwing m dice, 
m being less than m. Apparently there is a low negative correlation 
between learning capacity and rate of forgetting. The mean score 
after forgetting would be lower than the original score and the standard 
deviation would be larger (109, p. 211). From this it follows that if the 
dice illustration holds, we should expect an increase in relative varia- 
bility after forgetting. 

Tilton (91) reviewed 39 studies of the effects of forgetting on indi- 
vidual differences and found that in 24 cases absolute S.D.’s increased. 
The ratio of the average S.D. after forgetting to the average S.D. before 
forgetting was 106.7 to 100. The average V after forgetting divided by 
the average V before forgetting gave 165.6. In the majority of the 
cases then experimental evidence has shown that forgetting does in- 
crease relative variability. Forgetting tends to return a group to the 
greater relative variability which existed before practice. 

Watson (97) reports that after forgetting relative variability is 
greater for recall than for recognition. Probably this is related to the 
fact that power of recall is lost more rapidly than power of recognition. 
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Effect of Fatigue 

Fatigue, like forgetting, involves a loss of function. It differs in 
that it is more immediately and acutely related to physiological func- 
tions such as circulation and respiration. Short and long work periods 
may differ because the short work period may not be greatly influenced 
by “second wind”’ while the long experiment is so influenced. 

Wells (100) had 10 subjects tap for 30 seconds and compared the 
first 5 seconds with the last 5. Under these conditions, with a short work 
period, a reduction in relative variability was found. 

With long work periods, increases in relative variability have usually 
been found. Weinland (99) had 10 subjects work with the ergograph 
over a period of six months and reported that V increased with fatigue. 
He concluded that increased variability was due largely to loss of con- 
trol. Manzer (54) had 27 college men work to exhaustion on the ergo- 
graph and found that relative variability of work with fatigued muscles 
was 309 per cent of what it was with unfatigued muscles. Philip (69) 
had twelve subjects work about seven hours at tapping until they were 
very fatigued. He found an increase in relative variability with 
fatigue and like Weinland attributed this largely to loss of control. 
Edwards (18) reports the effects of the loss of 100 hours of sleep on 19 
subjects. He finds no great differences in V’s for controls and experi- 
mental group but does find that the extremes in reaction times are in 
the experimental group. Flugel (24) had 46 children do addition for 20 
minutes daily for 46 days. He reported that fatigue and ability were 
found to be negatively correlated. This should give an increase in rela- 
tive variability. 

Some studies have reported qualitative changes such as doing the 
right thing at the wrong time. Marked increases in irritability are also 
reported. The unfatigued individual is evidently more stable and 
predictable. 

One of the technical difficulties in the study of fatigue is found in the 
fact that long fatigue tests are monotonous. Hence what is classified as 
fatigue is probably partly boredom and lack of effort. This factor 
would tend to produce variation in the results of different experiments. 


Age Differences 


Vernon’s theory that variability decreases with growth has been 
mentioned. V’s based on data from Montague and Hollingworth (59) 
for length and weight of infants at birth are about 6 and 15 respectively. 
For young adults the corresponding figures are about 4 and 12, based 
on Army results for men (Table II) and on data from Doll (15, after 
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Smedley) for women. Thus far then the principle seems to hold. 
However, there is a prepubertal acceleration in growth which results in 
a rise in V’s at that period. The greatest prepubertal or pubertal rela- 
tive variability in boys occurs at about 14 years and in girls at about 
12 years. After birth then, V’s for height and weight fall, then rise, then 
fall to maturity. In young adults relative variability is lower than at 
birth. 

Sensory and psycho-motor functions follow varying patterns with 
respect to relative variability during growth. No simple generalization 
seems to cover them. Relative variability in visual acuity rises from 
about 12 years to maturity. Relative variability in throwing a bal! de- 
clines during adolescence in boys but seems to increase slightly in girls. 

Henmon and Livingstone (37) collected data from different sources 
and studied changes in relative variability of psycho-motor and mental 
functions during growth. They found relative variability to decline 
with age to maturity, and did not find any consistent evidence that 
relative variability increases at the prepubertal period. 

Israeli (44) finds decreasing variability in aesthetic judgments with 
age to maturity. Miles (58, after Price) reports data on visual percep- 
tion, span and speed which show a decline in V’s with age to maturity. 

V’s based on group intelligence test results generally show declines 
with age to maturity. Freeman and Flory (25) report the results of a 
ten year study with the VACO Tests. This covers ages 8-17 inclusive. 
V’s decline regularly with age without evidence of a prepubertal bulge. 
Lincoln (50) reports V’s for the Yerkes-Bridges, the Pressey, and the 
Dearborn intelligence tests. All show declines with age. Odom (60) 
collected data for the Otis, the National and the Illinois intelligence 
tests. They also show declines in V’s with age. Adkins (1) finds that 
on retests with the Otis and with the Morgan tests V’s decline from 
grade 7 through grade 12. 

The fact that these tests do not have a true zero leaves some doubt 
as to the exact significance of the results reported. However, analysis 
of mental growth curves has convinced the writer that there is a real 
decrease in the relative variability of intelligence during adolescence. 

The Stanford-Binet is not a suitable instrument for measuring 
changes in relative variability with age. It is used on the assumption 
that the S.D.—and hence V—remains constant at 16. Yet the original 
report by Terman and Merrill (85, p. 40) and later studies by Good- 
enough (30) and Brown (7) show marked declines in S.D.’s from 2.5 to 
6 years, then a marked rise to 12 years, and a decline thereafter. This 
is a prepubertal bulge in relative variability, but whether it is due to 
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variations in growth or to the nature of the test is not clear. This is of 
practical importance because it means that a child who is 2.5S.D.’s 
above average may flucutate 20 points in IQ simply by following the 
normal growth pattern for such children. 

The available evidence indicates that relative variability of intelli- 
gence declines during adolescence, but we cannot yet state with any 
certainty the exact changes that occur from birth to maturity. 

After maturity the general tendency is for relative variability to 
increase. Tests with an important speed factor show marked declines 
in the averages with age while those which emphasize power rather 
than speed are on the average less affected. However, Gilbert (28) 
reports losses in memory in senescence which show V’s up to 87. This 
is for retention of Turkish-English vocabulary. For the age period 
from 20 to 29 years the corresponding V is 23. 

The results secured by Miles and Miles (57) which showed a decline 
in intelligence in old age were secured with a speed test. On this the 
S.D.’s change little from 30 to 70 years but the mean scores decline 
greatly and V’s increase correspondingly. In another study already 
referred to above, Miles (58) submits data from Price showing M’s and 
S.D.’s from age 6 to old age. For visual perception, span and speed 
at 6-7 years V =32, at 20-24 years V=13, and for 75-79 years V = 34. 
Goldfarb (29) found an increase in relative variability in reaction times 
in males in old age but the changes in females were not reliable. The 
marked increase in relative variability in visual acuity with age has 
already been referred to. V rises above 90 at the age of 60 in men (77, 
p. 85). The V for auditory acuity also rises with age but to a lesser 
degree. 

There appears to be a fairly definite tendency for relative varia- 
bility to increase after early maturity, but there are great differences in 
the changes in different functions. An extreme increase is found in visual 
acuity while strength of grip shows only a slight increase. 


Variability and Change 


Darwin held that both rudimentary and highly developed struc- 
tures are unusually variable. Schultz (79, p. 321) makes a related 
suggestion: “‘Upon finding such a high variability in the human ear 
we are justified in suspecting that this structure is undergoing some 
evolutionary change, in the form of either an increase or a decrease in 
size."" These conclusions, added to the finding of differences in relative 
variability associated with age changes, seem to justify this tentative 
conclusion: The relative variabilities of both structures and functions tend 
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to be positively correlated with rates of change, both phylogenetic and on- 
togenetic, and both progressive and regressive. 

Schultz cites human wisdom teeth and little toes as examples of 
widely variable structures which seem to be in process of regression. 
We found sight and smell relatively more variable than hearing. Smell 
has been undergoing involution while sight has been evolving. 

This hypothesis agrees well with what is known about individual 
growth and decline. Quantitative individual differences are produced 
by differential growth rates—of which the IQ is one index. Relative 
variability is high in the foetus and in infancy when growth is most 
rapid. Growth rates and relative variabilities both decline with age 
until the prepubertal period. At this point physical growth is accel- 
erated and the corresponding V’s rise. From this point, both growth 
rates and V’s decline to maturity. Measurements of mental growth 
have not shown the prepubertal acceleration found for physica! growth, 
and V’s for mental functions do not rise at this point. As mental func- 
tions begin to decline with advancing age the corresponding V’s in- 
crease. Visual acuity declines very rapidly between 40 and 60 years 
and V rises above 90. 

High relative variabilities connected with change can in part at least 
be attributed to differences in the timing of changes in rates of growth 
and decline. Richey (75, p. 67), speaking of physical growth during the 
prepubertal and adolescent periods, comments: 

In general, it may be stated that measures of variability increase during the 
period that growth is comparatively rapid. A large part of the variability found 
for any particular group is probably due to differences in the periods of ac- 


celeration and retardation of the growth rates of the individuals making up 
the group. 


The prepubertal spurt in physical growth occurs earlier in some 
children than in others, even among those who will as adults be the 
same height. And decline in old age occurs earlier in some than in 
others. There are also differences in the age at which growth stops. 
Altogether these differences in rates of growth and decline and in the 
timing of changes of rate are responsible for a large part of variability. 


Sex Differences 


Scientific males have fairly commonly credited males with greater 
variability and have used this to explain the larger number of male 
geniuses. 

Pearson (66) assembled a considerable mass of statistics dealing with 
sex differences in physical measurements and concluded that human fe- 
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males are more variable than males. Pearl (65) collected data for about 
40 comparisons of the sexes on physical measurements and his re¢ults 
show that V’s for females are higher in a ratio of about 3 to 2. Todd and 
Lindala (92) made about 60 measurements of both white and Negro 
males and females, a total of about 120 comparisons. V’s for females 
are larger in a ratio of about 3 to 1. From these results it seems neces- 
sary to conclude that adult human females are relatively more variable 
than adult males in the majority of physical measurements. However, 
the average difference is usually small. 

It is generally accepted that females mature physically earlier than 
males. The average difference in age at the time of arriving at maturity 
is probably two or three years. And in discussing age differences we 
concluded that, with the exception of the prepubertal period, relative 
variability in height and weight at least decline with age to maturity. 
It follows that if we compare the heights and weights of boys and girls 
of the same age the girls are likely in the majority of comparisons to be 
relatively less variable simply because they are more mature. Their 
lower relative variability is a mark of greater maturity and not of a 
sex difference in relative variability. This possibility is recognized by 
Lincoln (50, p. 164) who says that variability may be a function of 
maturity. Since most of our statistics have been of school children we 
seem thus to have succeeded in reversing the true picture and have 
erroneously concluded that males in general are relatively more vari- 
able than females. If it were not for sex differences in the ages at which 
prepubertal and pubertal accelerations in growth take place, it is prob- 
able that males would more consistently be relatively more variable 
than females during the growth period. 

On intelligence tests at the adult level we do not know accurately 
how the sexes compare either as to the average level of ability or as 
to variability. Prevailing opinion is that the average level is about the 
same, but that there are average differences in ability to score on spe- 
cialized tests. 

That there is a sex difference in intelligence at ages 11,12 and 13 is 
shown by Table V. Scores on different tests have been made compara- 
ble by this method: the test score for 11-year-old girls is changed to 
11 and the test score for 13-year-old girls is changed to 13. The other 
scores are then converted to this scale by proportion. 

Terman (84, p. 66) does not report the numbers of cases for each 
age and sex but he gives a total of 905 cases for ages 5 to 14 inclusive. 
This is an average of 45.25 cases for each age and sex group. Hence I 
have used 45 in the table. The numbers of cases given for Pyle’s test 
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TABLE V 


SEX DIFFERENCES IN INTELLIGENCE AT 11, 12 AND 13 YEARS As DETERMINED 


By NINE INTELLIGENCE TESTS 














Age 11 Age 12 Age 13 
Test 

N Mean N Mean N Mean 

National (101) B 613 10.08 

G 643 11.00 
Illinois (101) B 155 10.51 171 11.27 166 12.13 
G 175 11.00 184 12.19 157 13.00 
Stanford-Binet, B 45? 10.27 AS? £4.71 45? 12.83 
1916 (84) G 45? 11.00 45? 12.03 45? 13.00 
National (53) B 98 10.33 89 11.69 89 12.16 
G 88 11.00 98 11.96 67 13.00 
Pyle (71) B 73 9.71 80 10.42 82 11.44 
G 73 11.00 89 12.07 73 13.00 
McCall (9) B 94 10.80 102 11.18 97 12.10 
G 98 11.00 101 11.82 84 13.00 
Pressey (70) B 179 10.41 182 11.26 174 12.48 
G 167 11.00 180 11.76 174 13.00 
Army Alpha (10) B 22 10.44 34 11.55 35 11.58 
G 33 11.00 31 12.12 35 13.00 
VACO (25) B 150 10.83 163 12.01 149 12.83 
G 176 11.00 168 12.03 144 13.00 
Yerkes ef al. (107) B 33 10.91 34 12.11 28 Zito 
G 29 11.00 32 12.68 26 13.00 
Weighted Means B 1453 10.38 900 11.42 865 12.29 
G 1527 11.00 928 12.01 805 13.00 

Adjusted B 3218 11.367 
Weighted M’s G 3260 12.000 


Difference and standard error 


.633 + .0464 
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are averages based on his table~ for seven different tests. The score 
for boys on the National Test reported by Whipple is arrived at by 
counting 15 points as 1 year—this from the test norms. On the Yerkes 
Point Scale, all cases from 11 to 12 are grouped at 11, and so on. The 
final difference score, called the adjusted weighted mean difference is 
found by adding 1 year to the 11 year scores and by subtracting 1 year 
from the 13 year scores. This groups all cases at 12 years. Distribution 
S.D.’s are assumed to be 16 IQ units in standard error calculations. 
Hence for 12 years the S.D. used is 1.92. 

This method gives the boys a mean score of 11.367 and the girls a 
mean score of 12. The difference, .633, is equal to 7.6 months. Taking 
three standard errors below and above the difference gives us .494 and 
.772. These are equivalent roughly,to six months and nine months. 
Hence we conclude that for the above battery of tests there is a sex 
difference in mental maturity of from six to nine months at the age of 
twelve years. 

Something should be said about results not included in the table. 
Whipple reported National test results for four cities for children aged 
11 years. In two of these cities a special effort was made to test every 
child of this age. This apparently was not done in the other two cities. 
I have included the two cities where all cases were tested and have ex- 
cluded the others. Wechsler-Bellevue and Revised Stanford-Binet 
results are not included because an effort was made to eliminate sex 
differences when the tests were standardized. Dearborn Test results 
are excluded because Lincoln (50, pp. 166 ff.) has shown that the test 
content unduly favors boys. As a result, it, like the two tests just men- 
tioned, gives about equal scores for the sexes. 

A battery consisting of these three tests would show the sexes about 
equal. So far as I know, no similar battery of general intelligence tests 
now in use in this country will show boys mentally more mature than 
girls at these ages. At the high school level, boys are likely to test higher 
than girls because the boys are more drastically selected. 

If girls test higher than boys at age 12, andif the sexesarrive at the 
same final average level, this shows that the zirls are maturing earlier. 
Since we found that relative variability decreases with age to maturity 
it seems that we confront about the same situation with respect to 
intelligence as that found for physical traits. When age comparisons 
show girls with lower V’s, the correct interpretation seems to be that 
girls are more mature, and, being more mature, they are consequently 
relatively less variable. When the boys arrive at the same maturity 
level, the V’s for them will have decreased. To ignore the maturity 
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TABLE VI 


Is DEFINITELY SUPERIOR 

















Test N Sex M S.D. V 

Artificial Language (19) 3236 M 22.8 11.0 48 
2632 F 29.5 12.9 44 

Arithmetic (19) 3236 M 29.6 12.5 42 
2632 F 21.9 11.2 51 

English (19) 3453 M 49.9 18.1 36 
2880 F $5.3 18.4 33 

Mathematics (19) 3453 M 34.7 14.9 43 
2880 F 25.2 12.3 49 

Geometrical Constr. No. 8 (93) 540 M 10.7 3.4 32 
703 F 8.9 4.3 48 

Tonal Intensity (53) 100 M 81.8 3 10 
100 F 77.3 11.3 15 

Tonal Memory (53) 100 M 75.5 11.8 16 
100 F 59.7 16.1 27 

Beta 2, Cube Analysis (103) 1161 M 13.12 4.80 37 
1160 F 10.06 4.08 41 

Beta 5, No. Checking (103) 1161 M 24.42 8.15 33 
1160 F 27.72 8.70 31 

Conservatism-Radicalism (82) 181 M 23.8 9.0 38 
206 F 26.9 8.4 31 

Unpleasantness (Sense of) (82) 187 M 31.6 16.7 53 
202 F 34.6 15.8 46 

Throwing a Ball (15 yrs.) (23) 117 M 134.8 24.18 18 
138 F 73.6 18.5 25 

Grip Strength (22.5 yrs.) (77) M 82.16 11.86 14 
(20) F 51.00 10.20 20 


Mechanical Comprehension (2) 
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variable and to draw conclusions about adult sex differences from age 
comparisons of boys and girls leads apparently to error, and possibly 
to the reverse of the truth. 

We have found that relative variability usually decreases with de- 
velopment. On this basis, if females are better than males in Trait A 
and are poorer in Trait B, they should, other things being equal, be 
relatively less variable in A and more variable in B. The largest sex 
differences on the American Council Psychological Examination are 
found for artificial language and arithmetic. On the Iowa High School 
Content Test the differences are greatest for English and mathematics. 
Table VI supplies the California Junior College norms for these tests 
as reported by Eells (19). To these results I have added geometrical 
construction (Touton, 9."), tonal intensity and tonal memory (Mc-, 
Nemar and Terman, after Church, 53), Army Beta 2 and 5 (Winsor 
103), conservatism and sense of unpleasantness (Skaggs, 82), throwing 
a ball (Espenschade, 23), strength of grip (Ruger and Stoessiger, 77; 
Elderton and Moul, 20), and mechanical comprehension (Bennett and 
Cruikshank, 2). V’s have been calculated or checked by the writer 
and are given to the nearest whole number. 

The majority of these V’s are too high because of the lack of true 
zeroes. V’s for throwing a ball are too high because, as Hill (40) points 
out, the distance travelled by a projectile varies as the square of the 
initial velocity. Hence theoretically we should take the square root of 
the original measurements before computing the value of V. However, 
these defects affect both sexes in the same way, so that our comparison 
of the sexes is valid even though the V’s are too high. 

An examination of the table shows that the hypothesis holds in all 
cases. However, no claim is made that this is always true. It should 
be said that this table is made up of characteristics which show the larg- 
est sex differences in averages. If a random selection of traits were 
studied, most of the differences would be small, and as a result of errors 
of sampling and of measurement a much less consistent result should 
be expected. In any case, the hypothesis seems to work often enough 
to justify the suggestion that adult women will more commonly be 
relatively less variable in those things in which they are better; while 
men wil! be relatively less variable in those things in which they are 
better. 

Absolute variabilities are less consistent. In six out of fourteen cases 
the higher mean goes with the lower S.D.; in the remaining cases the 
higher mean goes with the higher S.D. . 

As related to the question of genius, absolute variabilities seem 
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more important than relative variabilities, because, as was stated in 
the preceding sentence, in eight cases out of fourteen, the higher mean 
is associated with the higher S.D. However, Table VI is in most cases 
not based on the performances of mature adults, and measurements for 
making a fair comparison of adults are not available. 


Race Differences 


The V for opinions on this subject is very large, but the problem 
must remain unsettled until we have more measurements. 


Differences within the Individual 

Trait Variability. We have mentioned Hull’s estimate (42) that on 
the average the different traits of a single individual vary 80 per cent 
as much as a single trait varies in a group of individuals. Instead of 
trying to answer the question in that form we can give a more certain 
answer if the question is put more specifically. 

The amount of trait variability in the individual depends on the 
intercorrelations between the traits in question. Ghiselli (27) has sup- 
plied a formula for computing the extent of trait variability from the 
average intercorrelation between traits. Most investigators have found 
that the intercorrelations between simple motor functions are near 
zero. This means that trait variability of motor functions is about 100 
per cent of the variability of a single trait in a group. This is verified in 
a study made by Owens (61). As the intercorrelations between the 
traits studied increase, trait variability within the individual decreases. 
Presumably the more nearly we are able to measure pure and inde- 
pendent traits the lower the intercorrelations will be and the greater 
the trait variability. 

Measurements of complex functions, achievement tests included, 
usually involve considerable overlapping with the result that individual 
variability is correspondingly reduced. Gray (31) reports that the 
average range of individual variation on six achievement tests is two 
S.D.’s. This means that individual variation on such a limited battery 
of tests is about 35 per cent of group variation. 

From the foregoing it appears that the amount of trait variability 
found in any particular study will depend largely on the battery of 
tests used. Any general average for all psychological traits would 
under present conditions be somewhat arbitrary and tentative. 

Owens (62) studied the effects of practice on a group of motor traits 
and found that they did not become more alike with practice. This 
seemed to indicate that motor trait differences were due to innate 
causes. 
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The question has been raised as to whether there are differences in 
trait variability at different levels of ability. Among the studies of 
this problem are those by De Voss (13), Hertzman (39), Bown (5), and 
Gray (31, 32). Their results are conflicting and incone'usive. However, 
Garrett (26) has shown that the correlations between such functions as 
verbal, numerical and spatial abilities decrease with age to maturity. 
From this it would follow that the amount of trait variability would be 
positively correlated with the level of ability. The failure of other 
workers to find clear evidence of this trend may be attributed to either 
or both of two factors: the abilities tested were too complex or the age 
range of subjects was too limited. 

Quotidian Variability. Woodrow (104) uses the term ‘quotidian 
variability’’ to cover variations in level of performance from day to 
day. Owens (61) uses the term “repetitive variations’ and includes 
under this the systematic changes due to learning. In a study of motor 
skills Owens finds repetitive variations to be about 13 per cent as great 
as individual differences, and he attributes 90 per cent of the repetitive 
variations to learning. This makes quotidian variability unimportant 
in his study. 

Elliott (21) finds that strong motivation decreases variability of 
performance on practice tests in rats. From this we can infer that a 
part at least of quotidian variability is due to differences in concentra- 
tion and effort. This, of course, is what we should expect. 

Variations in the results of achievement, intelligence, personality 
and other tests show that individuals do vary considerably from day 
today. The unreliability of questionnaire tests is well known. So is the 
unreliability of teachers’ marks. All of these depend largely on quo- 
tidian variability. 

Paulsen (63) points out that coefficients of reliability obtained by 
the split-half technique are higher than those obtained by the test- 
retest method. This difference is due to fluctuations in the strength 
or efficiency of the traits themselves. He proposes to measure this by 
correcting the test-retest reliability for attenuation. This will indicate 
the highest possible test-retest reliability. The Spearman-Brown cor- 
rection formula does not apply to test-retest reliabilities, and in a later 
study of steadiness (64), Paulsen finds that the highest possible test- 
retest reliability is about .80. This means that the factors contributing 
to quotidian variability in steadiness are much more important than 
Owens found in his study of motor abilities. 

A related study of trait consistency on the behavior side has been 
made by Trawick (94). He finds consistency in performance to be an 
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important indicator of the integration of personality. The more con- 
sistent individuals are generally more self-confident, ascendant, have 
higher self-esteem, are more goal-seeking, more sociable, more predict- 
able, objectively more modest, and have more social insight. From 
this point of view, the study of quotidian variability involves us in the 
problems of unstable personalities. 

This should be a profitable field for future study and particularly do 
we need to learn more about the limits of test-retest reliabilities. 


Discussion 


If we arbitrarily define a superior person as one who scores three 
sigmas above average, the superior person is about twice as good as the 
average in relatively simple capacities such as visual and olfactory 
acuity. As we descend the V scale the superior person’s ratio of supe- 
riority decreases. On tapping speed the superior person is about forty 
per cent better than average. On stature he is only about twelve per 
cent above average. On level of CAVD intelligence he is only about 
eight per cent above average. Also if total ability is considered to be 
the sum of different more specialized abilities, the superior individual 
will deviate less from the average in total ability than he does in some 
specialized abilities. On this basis probably we should not be surprised 
when we find that a successful politician shows more superiority in the 
particular characteristics required for winning votes than he shows later 
in the way of general administrative ability. 


GENERAL CONCLUSIONS 


Much of the confusion found in conflicting statements about 
relative variability is due to the use of different methods, to failure to 
discriminate between absolute and relative measures of variability 
and to the use of the V formula with scores that are far from having 
true zeroes. 

Too many of the “laws’’ and broad generalizations about relative 
variability proposed by different writers will not stand critical examina- 
tion. More complex, higher and more recently developed functions 
tend to be relatively Jess rather than relatively more variable. As 
judged by relative variabilities, complex mental functions seem to be 
the sums rather than the products of simpler functions. 

Practice usually reduces relative variability. Fatigue and forgetting 
usually increase rciative variability. Changes in relative variability 
w'th age differ greatly according to the trait under consideration but 
with a definite tendency for relative variability to decrease with age 
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until maturity and to increase thereafter. Sex differences in relative 
variability are small, but greater relative variability is more often 
found in boys. This probably means that girls of a given age are more 
mature than boys of the same age and not that females are generally 
relatively less variable than males. Adult females are relatively more 
variable physically than males. In general, low relative variabilities 
indicate superiority. Each sex tends to be relatively less variable in 
those traits in which it is superior. 

Relative variabilities are determined largely by rates of growth and 
decline and by differences in the timing of changes in rate. 

The per cent which intra-individual trait variability is of individual 
differences ranges from 100 downwards, depending on the traits tested. 
The study of quotidian variability is largely in the exploratory stage. 
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A DISCUSSION OF SOME CAUSES OF OPERATIONAL 
FATIGUE IN THE ARMY AIR FORCES* 


LESSING A. KAHN 
University of Pennsylvania 


DEFINITIONS 


According to the Flight Surgeon’s Handbook, operational fatigue is 
defined as a predominately emotional condition found in air crew 
personnel as a result of abnormal strains being placed on normal in- 
dividuals. 

... flying stress or operational fatigue is used to describe a condition that 
may be oserved as an abnormal flying strain being placed on a normal indi- 
vidual (10). 


Its prevalence was marked in members of crews engaged in combat 
flying. The dangerous and extreme nerve-racking character of combat 
flying acted as overwhelming stresses upon many so engaged. 

The supposition that “flying fatigue’’ is in any strict sense to be 
construed as being equivalent to the more general and more subtle 
syndrome of operational fatigue is, in my opinion, without basis; 
notwithstanding that such fatigue may enter as a factor contributing 
to the generation of operational fatigue. Spiegel notes (12) that the 
term ‘‘flying fatigue’’ as used to describe a clinical state may be mani- 
fested by any one of the following symptoms: anxiety, depression, 
anorexia, dreams, agitation, tremors, loss of confidence, regressiveness, 
and many others that are psychological in origin; fatigue of this sort is 
not a reaction to flying per se, rather to the conditions under which 
flying takes place. Hastings (5) makes a most accurate and clear 
differentiation between these two types of fatigue: 

Flying fatigue... means ordinary fatigue and the physical and mental 


symptoms of it and does not imply that the individual is emotionally sick. 
Flying fatigue is the same as the fatigue any individual would suffer if he had 


* The author wishes to express his appreciation to Prof. Samuel W. Fernberger for 
reading this paper and offering many helpful suggestions. 

(Editor’s Note: Mr. Kahn served in the Army Air Force as a combat navigator with 
the rank of ist Lieutenant. He was stationed in Italy with the 450th Heavy Bombard- 
ment Group, 15th Army Air Force, from July until September 1944, when he was shot 
down in combat over Vienna on September 12th. He was then interned as a prisoner of 
war in Germany where he had the opportunity to collect case history materials con- 
cerning the acute emotional conditions considered in his review.) 
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insufficient sleep, rest, and relaxation, and had been exposed to the nervous 
strain of flying. 
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Operational fatigue . . . is used to describe a typical syndrome of breakdown 


occurring in essentially stable individuals, who by continued stress, harrowing 
experiences, and physical fatigue develop an illness which is roughly half fatigue 
and half emotional illness. 






SYMPTOMS 


The most common symptoms of operational fatigue lie within the 
broader context of anxiety states (4)—in which the individual is still 
attempting to deal with a situation that has long since been eliminated 
from the present sphere of environment; in short, the individual is 
attempting to deal with reactions to combat. Moreover, according to 
Hastings as regards to the direction in which aggression can spread: 


. .. schematically one can hit back at that which threatens one, one can hit 
at other people, at inanimate things or at oneself . . . [projection, dissociation, 
introjection, etc.] 

In a study made of combat fatigue, etc., 29% of successful combat men state 
with feeling that they have a personal hatred against the enemy, that they want 
to kill the men in the enemy fighters—at least when they are in the air. More 
than 70% state that they develop irritability and quick flaming anger at their 
crew, mates, in a way entirely foreign to their usual feelings and actions, as their 
operational tour progresses. Such feelings of animosity develop most charac- 
teristically during periods of inactivity, when there is little active outlet for 
their feelings. The monthly dances held at groups bases have been marked by 
a number of violent fights. For example, on one occasion two squadron com- 
manders, quite close friends, riding back to the post after an evening of moderate 
drinking, amicably and deliberately decided they “needed a fight,”’ and without 
any quarrel of any sort, got out of the car and fought violently until one broke 
a metacarpel bone, after which they amicably climbed back into the car and 
drove home. 

In order to relieve tension, it was not uncommon for men to shoot out the 
lights with a tommy gun or to shoot one’s initials into a wall with a pistol. The 
feeling was that of just blowing off or smashing something. 

Several men have reported, upon close examination, that they have seduced 
women, not for any sexual gratification, but for the sake of subduing and con- 
quering their defenses. 

Enlisted men’s barracks are the scenes of consistent and brutal attacks upon 
the equanimity of new crews coming into them. The old combat man describes 
to the new men the appearance of a man’s brains scattered about a plane by the 
action of a 20 mm shell, and the like (5, pp. 137-139). 


The persistence of anxiety, brought about by flying incidents in 
which personal security and safety has been menaced, is characterized 
by a breakdown in adaptive mechanisms. This is manifested by the 
individual’s failure to relax and be free of tension when he is no longer 






36 LESSING A. KAHN 


confronted with the combat situation. Grinker and Spiegel (4) give 
us a clear picture of the clinical syndrome of anxiety: 


Severe anxiety states result in an intensely striking, unforgettable picture. 
Terror-stricken, mute, and tremulous, the patient closely resembles those suf- 
fering from an acute psychosis. The facial expression may be vacuous or fearful 
and apprehensive. There are course tremors of the extremities. Speech is 
usually impossible except for a few stuttering attempts to frame an occasional 
word. Sudden fits of crying or laughing may occur without reason. Behavior 
is extremely bizarre, and attitudinizing with apparently senseless gestures, al- 
ternates with periods of excessive activity, characterized by running about the 
ward and leaping over the beds. Terror is one of the principle themes of the 
patient’s behavior. He resembles a frightened inarticulate child, with only a 
few persistent “‘islands’’ of his past well-organized behavior. 

Mild anxiety states in contrast to the severe anxiety states present uniform 
and sometimes monotonously similar clinical pictures. Upon going into battle, 
anxiety appears in gradually increasing amounts. At first it is kept under con- 
trol through an effort of will. Under the continued stress of battle, of near 
escapes from death, of constant exposure to anti-aircraft fire, the anxiety over- 
whelms him. He develops gross persistent tremor and feels weak, as if his legs 
would carry him no further. He becomes dizzy so that intelligent thought is 
impossible. 


The picture of the mental state of these men is typical and strikingly 
uniform as to symptomatology (3, 4, 5, 6, 9, 12). On missions the 
individual is usually tense, jittery, restless, and inefficient. In the 
presence of flak or enemy fighter opposition, he becomes overly appre- 
hensive and fearful of the entire combat presentation. He complains 
that the airplane will be shot down, that he will have to bail out, that 
the plane will be shot up to such an extent that escape for all will be 
impossible, or that he will be hit by flak or other missiles. In many 
cases, free-floating anxiety is experienced along with the usual physio- 
logical concomitants of anxiety, such as, dizziness, nausea, feelings of 
weakness, high blood pressure, rapid pulse, extreme perspiration, and 
the like (4). More severe symptoms may become evident such as head- 
aches, vomiting, fainting spells, etc. Needless to say, the appearance 
of these symptoms of fatigue during the course of a mission not only 
endangers the entire organization, but summarily affects the over-all 
efficiency of the individual concerned. In the more severe cases, a state 
of fear and terror develops to the point where bailing out becomes the 
only possible solution. Levy (7) cites several interesting examples of 
patients who lay upon the floor of the plane to prevent themselves from 
giving in to the impulse of bailing out. In between missions, sleep is 
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disturbed, dreams and nightmares centered about combat experiences, 
increased tension and irritability, etc., are the rule (4, 5). 

The appetite is impaired and weight is usually lost (3). Excessive 
drinking and smoking as an attempt to relive anxiety is common; and 
the need for finding sexual outlets is intensified by the factors of fatigue 
and anxiety of these individuals (4, 5). A frequent complaint (12) and 
at the same time a symptom is the loss of confidence in the ability to 
perform the job with responsibility, and a conflict occurs with regard 
to ultimate consequences to the remaining crew members. This brood- 
ing results in further conflict and anxiety, which has its most telling 
effect on an already weakened ego with ever increasing feelings of 
insecurity and loss of balance. 

The following case cited by Grinker and Spiegel illustrates for us the 
developmental aspects of the fatigue syndrome: 


A twenty-four year old airman had demonstrated fine skill as a pilot and was 
much respected for his aptitude and judgment. During a night mission, the 
bomber plane in which he was flying co-pilot became lost. The pilot cruised for 
hours looking for home base, until finally the gas ran out and he instructed the 
crew to parachute to safety. However, he had misjudged the altitude. Four 
other members of the crew got out safely, but the patient had considerable 
difficulty getting out of the pilot’s compartment to the escape door. There was 
much delay and loss of precious time while he struggled to get through; finally 
by a super-human effort he made it and jumped, leaving the pilot still in the 
plane. Because of the low altitude, his parachute had just opened when he 
landed on rough ground, injuring his back. A few minutes later he heard an ex- 
plosion accompanied by a tremendous flash of flame; obviously the pilot had 
crashed some distance away. Although he was concerned for the safety of the 
pilot, nothing could be done that night because of the complete blackness. 

The following morning he rounded up the other members of the crew who 
were safe in the area, and set out to search for the plane and pilot. They 
found the pilot’s body, a charred and smashed mass, among the twisted 
wreckage of the plane. Clearly, the pilot had not had time to escape before the 
plane crashed. Eventually the men were picked up by friendly natives and 
made their way back to base where the patient was hospitalized because of the 
back injury, which consisted of a fractured transverse vertebral process. In the 
hospital he at first exhibited marked continuous anxiety, associated with tremor 
and terrifying dreams, in which he saw himself falling from a plane and crashing 
on the hard ground. Considerable depression and grief for the dead pilot, who 
had been a good friend of his, was manifested. The grief was accompanied by 
some conscious guilt because of the patient’s delay in escaping from the plane. 
This had not actually been his fault; but had it not occurred, the pilot could 
have jumped in time to save his life. In time the anxiety and depression dis- 
appeared to be replaced by a pronounced conversion symptom related to the 
back injury. The lesion had healed; but the patient continued to complain of 
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the severe pain. At this point hospitalization was discontinued and the patient 
returned to his unit in the hope that he would give up the conversion symptom, 
when back in the environment of his friends and active operational duties. He 
was given ground duties as an operations officer, and after a short time the 
conversion symptom disappeared, to be again replaced by a slight anxiety and 
depression. The patient felt very badly because his friends and class-mates at 
flying school were participating in missions while he remained on the ground. 
Accordingly, in spite of the mild anxiety, he asked to be restored to flying 
status. 

After a short period of training, during which he did very well, he was as- 
signed a crew and thereafter participated in seven bombing missions over Tunis 
and Bizerte, each time going through heavy flak, which severely damaged his 
plane. He had no increase of anxiety in relation to these missions, and felt con- 
fidence in himself. On the eighth mission, however, his tent mate and best friend 
was shot down in flames over the target. The patient brought his own plane 
back uneventfully, but after his return he had an intense recurrence of anxiety 
with tremor. That night he had the first recurrence of the anxiety dream of 
falling. The next day he continued to have anxiety, but determined to fight it; 
he said nothing and went on a mission. However, he had so much anxiety that 
the experience was like a nightmare and he could scarcely keep his mind on the 
job. In addition, he was harassed by a specific phobic apprehension that the 
plane was falling off to the right—the direction in which he had seen his friend’s 
plane fall. This phobia persisted, in spite of the evidence of his senses and the 
instruments that the plane was in level flight. So much did this obsess the 
patient that he almost crashed the plane on landing, because of miscalculation, 
greatly upsetting the crew, who reported the situation to the Flight Surgeon. 
The patient confessed the recurrence of anxiety to the Flight Surgeon, who 
referred him for psychiatric evaluation, and it was at this point that we first 
saw the patient. 

In the hospital he exhibited moderate anxiety, tremor, and dreams of falling, 
with a marked phobic response to every aspect of flying. He could not think 
of planes without strong anxiety. Because he had enjoyed flying, this reaction 
gave him considerable depression . . . (4, pp. 122-126). 


In sum, the crux of the problem of adjustment to the combat situa- 
tion is the degree of success which the individual is able to maintain in 
order to cope with the manifold factors effective in the combat situation, 
namely, threats to the individual’s personal safety and security. The 
initial phases of the fatigue syndrome are constituted of mild anxiety 
states, in which the individual is attempting to reconcile his continua- 
tion in combat and maintaining his own personality intact. When such 
attempts are uniformly unsuccessful and prolonged, the resulting anxi- 
ety states become more intricate, and implications therefrom become 
more inclusive symptomatically speaking. On the other hand, severe 
apprehension of the combat situation is the most likely extreme to be 
manifested by those individuals relatively free from anxiety and other 
fatigue symptoms. 
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CAUSES 


Armstrong (1) lists the occupational causes of fatigue as follows: 


I. Physical Agents Il. Emotional Stresses 

a. Heat a. Physical discomfort 
b. Cold b. Responsibility 
c. Improper clothing c. Attention 
d. Vibration d. Concentration 
e. Glare e. Alertness 
f. Noise f. Apprehension 
g. Wind g. Anxiety 
h. Acceleration h. Fear 
i. Barometric changes 

Ill. Deficiencies IV. Toxic Agents 
a. Oxygen (anoxemia) a. Carbon Monoxide poison 


Not only does combat flying involve the many factors given above, 
but more comprehensive ones such as geographic and climatic condi- 
tions, enemy installations, and opposition in active combat. Among 
all of the factors contributive to operational fatigue, my experience 
has shown the emotional stresses to be most potent. Such other 
causative factors as deficiencies, toxic agents, geographic, and climatic 
conditions, etc., I have found to be only incidental to the really dispos- 
ing factor of emotional stress. 


Physical A gents 


In a consideration of combat stresses which dispose crew members to 
the sundry forms of operational fatigue, physical factors which operate 
during a flight play a decisive role. Missions are flown at high altitudes 
and in close formation to afford a maximum amount of protection from 
enemy opposition. Oxygen masks, electrically heated suits, steel 
helmets, and specially built armor suits (flak) are worn on all missions. 
Pilot and crew members must suffer these physical discomforts for long 
periods of time and the effects of these precautionary devices are telling. 
Constant opposition from enemy fighter aircraft, the noises of bursting 
machine guns, canon, shell hits, flak bursts and hits, the humdrum of 
engines being throttled back and forth, the rush of the wind, the noise 
of radio and interphone equipment, and a host of other physical events 
all contribute to the general emotional and physical tenseness of the 
situation on hand. The following report by Hastings (5) vividly por- 
trays some of these conditions present on a combat mission: 

The airplane, a B-17, on a mission over a distant target in enemy-occupied 


Europe, had most of its controls shot out by attacking planes before reaching 
the target, so that the ship was knocked out of formation. The pilot, however, 


40 LESSING A. KAHN 


with the exercise of great skill and strength, persisted in making an effective 
bomb run. Following this, the lone airplane was attacked by about 100 FW-190 
fighters over a period of perhaps three-quarters of an hour, during which time 
extraordinary damage was done to the plane and crew. Virtually all of the 
crew were wounded, three severely, and one became anoxic as a result of the 
simultaneous explosion of a 20 mm. cannon shell next to him, and the severance 
of his oxygen system. Almost all of the control cables were cut in various 
places, the oxygen, hydraulic, and electrical systems were knocked out, the inter- 
phone and radio systems destroyed, a small fire started in the bomb-bay, large 
holes were put through both wings, holes were in both propellers, bomb-bay, 
fuselage and nose; the tail assembly received so many direct cannon hits that 
it vibrated violently, and, after inspection by the flight engineer, was expected 
to tear off entirely at any moment. 


We must be cautious, however, not to localize any of these stresses 
and claim them to be the only causative factors operatirg to produce 
fatigue. Thus, in discussing physical agents as participating in the 
production of operational fatigue, we shall also note that each and every 
one listed is highly charged with emotionally disturbing aspects. 

During the first five missions most crews would have encountered the har- 
rowing experiences which were the normal events for heavy bombers operating 
from this theatre [Eighth Air Force stationed in England]. Watching close in 
and constant enemy fighter attacks, flying through seemingly inpenetratable 
walis of flak, seeing neighboring planes go down out of control, and at times 
explode in mid-air, returning with dead... (5). 


High altitude flying in itself constituted one of the greatest hazards 
of combat flying—the most important aspect of which was the constant 
need for oxygen in proper amounts in order to function at maximum 
efficiency. The intense cold and the danger of frost bite were constant 
reminders to all crew members of the latent dangers at hand. 


My first two or three missions bring to mind the extreme uncomfort of high 
altitude flying. Equipped with inadequate heated suits, I would lie huddled in 
the forward end of the nose of the ship, where I could at least observe the 
instrument panel and record with frozen fingers the progress of the plane. | 
frankly confess that I was completely oblivious of all that was going around 
me, that is, fighter and flak opposition. Under these conditions, I managed to 
perform my duties, although at a minimum of proficiency. _ 


Glare played an important part to offend crew members, especially 
pilots and gunners who had to be at all times at top operational ef- 
ficiency regardless of such disturbing factors. The great feeling of mu- 
tual dependence which crew members held for one another often 
caused them to endure the most trying of hardships, such as flying into 
the sun and holding such a formation position until the tactical situa- 
tion changed. Gunners constantly had to scan the skies and be on the 
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alert for oncoming fighters—enemy fighters that purposely held the 
sun behind them in attack so as to hamper our own gunners. 

Whatever the rationale operative in these physical conditioning 
factors, one point is outstanding, namely, while they may not in them- 
selves be sufficient to bring about a partial or complete personality 
breakdown, yet their effect was quite apparent. 


Geographic and Climatic Conditions 


Morale and general fitness of crew personnel, preparatory and sub- 
sequent to combat duty, played a vital role in maintaining top ef- 
ficiency. The need for stimulating and diverting environmental changes 
was recognized early’ in the war, mainly due to the many combat 
reports which stressed such factors as positive measures in the pre- 
vention of fatigue and other mental disorders. 

Tropical or sub-tropical climates with their heat, humidity, and 
frequent rainfall were novel to fliers, and had definite effects upon them 
—producing run-down physical conditions, loggy feelings, and lazy 
attitudes (3). Semi-civilized countries, such as was the case in the 
South Pacific area, that lacked even a semblance of modern sanitary 
facilities, constituted a constant menace to the health of the men. 
Because of geographical location and difficult supply lines, food and 
nutrition in general became another problem in keeping the morale and 
general fitness of the men at a high level. Repetitous diets with little 
or no variation acted to make men disgruntled and malcontent, and 
made for the development of unwholesome attitudes with respect to 
their duties. Dougherty reports: 

During the month of November 1942 physical fitness in the pilots under- 
went a rapid decline. There was an acute shortage of fresh fruits, meats, and 
vegetables, and the foodstuffs that were available contained many gas forming 
foods. Combat missions had to be flown in the morning and in the afternoon, 
and for a period of three weeks the chief food for the noon meal was a stew 
which was highly seasoned and which contained considerable gas forming 
elements. As a result there was a great increase in minor gastro-intestinal com- 
plaints such as heart-burn, gastric-distress, and at times nausea. The constant 
repetition of certain foods caused a distinct distaste for them and a very definite 
loss of appetite, and the food was not eaten. Consequently a diet which was 
satisfactorily balanced became one which did not meet the nutritional demands, 
and malnutrition characterized by weight loss developed. Vitamins were avail- 
able in the foods and in the supplemental multi-vitamin capsules; however, 
vitamin C was inadequate and remained so throughout the period. Fortunately, 


some fresh food supplies arrived in the early part of December, and the physical 
condition of the men improved (3, pp. 36-37). 


In the European theatre of operations, the over-all situation was not 
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radically different from that in the South Pacific. For example, I found 
sanitation facilities in South Italy (from which a greater proportion of 
combat missions were flown) were none too good. Veneral diseases 
were high because of the highly infected population living close by to 
operational fields, and recreational facilities in near-by towns were in 
in many instances limited due to the war-torn state of affairs. Toward 
the end of the war, the pace of battle progressed so rapidly that men 
were not certain about where they were going to sleep from night to 
night. The constant demands of battle tactics kept most flight per- 
sonnel constantly on the move without much opportunity for recreation 
of a too varied sort. 

In short, factors which were in many instances completely out of 
the control of army commands often contributed towards making the 
lives of combat men monotonous and void of any diversion from the 
nerve-wracking business of combat. This lack often, while not acting 
as the precipitating cause of operational fatigue, certainly contributed 
important aspects to its development and fruition. 


Emotional Stresses 


Probably the most important aspects of the personality structure 
of combat personnel were those to which we commonly refer as emo- 
tional. In any given combat situation, physical and environmental 
stress void of any emotional contributing elements is without meaning. 
The effect of enemy gun fire, for example, produced considerable ten- 
sion and is in one sense to be looked upon as a physical stress; yet in a 
stricter sense the emotional ‘‘charge’’ which such a condition produced 
was more manifest and effective. Among these conditions the following 
are important: 


1. Enemy aircraft may be encountered and may result in seriously wounded 
or killed crew members; or ‘“‘bailing out’’ or crash landings in enemy territory; 
flying over mountains, over jungles, and over water. 

2. Geographic and climatic conditions, such as missions through storms, 
overcasts, and all sorts of altitude conditions that conjure up all sorts of possi- 
bilities as to the final outcome of the mission (for example, getting lost, crash 
landing, running out of fuel, parachuting, and the like). 

3. Base conditions, that is, location of bases with reference to enemy opera- 
tions; how frequently they are subject to attack by enemy bombers, strafing 
raids, and the like. 

4. And finally, each and every member of a crew feels responsibility toward 
his country and the principles for which he is fighting; that is, the men feel 
the necessity of accomplishing a mission not because they are given credit on 
the ‘‘board”’ for it but because they see it as a stab at the enemy. Their par- 
ticipation means that many more bombs. Members of a crew hold themselves 
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responsible for the fear they manifest, not so much of the enemy fighters or flak 
that they undoubtably encounter as of what their buddies and squadron 
organization will think of them. 


The requisites of a battle situation suggesting any of these afore- 
mentioned possibilities produce extreme emotional stress in all con- 
cerned. These are the problems which properly set the stage for emo- 
tional conflicts—conditions and situations in which the factors of en- 
vironment, physiological and emotional stresses, and so on are mutually 
reinforcing. 

Responsibility. The responsibility which command members of a 
flight crew feel for subordinates is probably one of the most potent 
motivating factors operative in personal relations. Does a pilot, for 
example, consider himself adequately responsible for the safety and 
well being of his crew? On non-operational flight duty, there is little 
emotional stress on the part of a pilot in this one respect. There is 
actually little need for it, since such missions do not call upon the pilot 
to exercise anything above ordinary operational control over the 
members of his crew. However, in battle, the situation is quite differ- 
ent. In addition to routine flight command responsibilities, he must also 
display judgment about the battle situation, he must effect decisions 
of paramount importance, bearing in mind at all times his responsibility 
to the command and its mission (of which he is a vital part) and to his 
crew members. At times, the emotional burden which this responsibility 
imposes is exceedingly heavy, and there are many instances of pilots 
(as well as other members of a crew) breaking under these demands. 
Wrong judgments are feared most of all, for the decision formulated in 
battle must be quick and accurate if the crew is to survive. Supposedly 
the selection of pilots for combat has been determined by a man’s 
capability of exercising such responsibility without serious psychiatric 
consequences. However, this selection is not always effective for as 
we have mentioned earlier, it often happens that apparently stable indi- 
viduals break unexpectedly with disastrous consequences for all con- 
cerned. 

I recall from my own experience several first pilots who resigned their posi- 
tion to accept a co-pilot’s position because, as they claimed, they did not wish 
to assume the responsibility of nine crew members. They felt that they did not 
possess the necessary requisites for making quick decisions, for evaluating 
combat situations, and the like, contending that a wrong move on their part 
would bring serious consequences for which they did not want to feel everlast- 
ingly responsible. 

I also recall in connection with this discussion the incident involving a 
B-24 pilot who had lost a good portion of his crew by the exercise of a wrong 
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judgment, namely, he pulled his ship out of formation with 3 engines in good 
condition, and thereby became an easy target for enemy fighters. 

The question of responsibility does not apply to the pilot alone, but 
extends to all members of a flight crew. The navigator is responsible to 
his crew to see that they reach their target and that they have a safe 
return route. He must be cautious with regard to his computations, 
the accuracy of the ship’s course, and many other details for which he is 
solely accountable. Mistakes on his part may not only affect the men 
in his own ship, but in many cases (e.g., when he is in the lead ship of a 
bomber formation) he is performing for a great number of ccher ships. 
One can readily realize the tremendous emotional stress which a navi- 
gator must undergo who is entrusted with these responsibilities. 

A navigator, during the heat of an intense fighter attack, abandoned his 
position and in great fear and anxiety fell to the floor of the plane for protection. 
The plane was crippled as a result of the attack and it became separated from 
the rest of the bomber formation. The navigator, by his action, had failed to 
keep track of the bomber’s course and it turned out subsequently that the pilot 
brought the plane over a flak area and was shot down. The navigator’s neglect 
to perform his duty can be attributed not only to his fear and anxiety, the 
resultants of the battle situation, but to a definite lack of responsibility toward 
his crew mates. The literature is replete with cases in which men have been 
overcome with fear and anxiety, but who have refused to give up their position 
because of the responsibility that they had toward their buddies. 


The bombardier is held accountable for dropping his bombs on the 
target, not only that the mission might be considered successful, but 
that repeat missions will not become necessary and thereby further 
endanger a large force of men and planes. Bombardiers, because of the 
stresses of combat conditions, become emotionally upset, so much so, 
that repeat runs on the target become necessary—this entails one or 
more additional runs on the target that may very well be heavily de- 
fended by flak. Anxiety and fear, under these circumstances, are magni- 
fied considerably in all personnel concerned. Bombardiers who dis- 
played these characteristics soon became known and stood out in the 
squadron; their unfavorable reputation often caused them to become 
shy and seclusive; to complete the picture, the typical symptoms of a 
war neurosis became evident. 

On a bombing mission directed at a rail-bridgehead located in northern 
Italy, I recall the following details pertinent to our discussion. This was a 
very heavily defended target, since it was a main line of supply for the German 
forces in Italy. ‘he task was to bomb the bridge and thereby hamper the 
supply flow. The group bombardier had had considerable experience in combat. 


It turned out that three bomb runs were made on this target; losses were much 
higher than necessary. The feeling of tension release which one experiences 
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after making a bomb run (in getting out of a flak area) cannot be minimized. 
In repeating the bomb run twice, with the enemy’s flak becoming more and more 
accurate on each succeeding trial, I venture to say that not a small number of 
men would have gladly shot the lead bombardier. 


Gunners in all positions were held responsible for defending the ship. 
This task was very important for interference from enemy aircraft 
could and often did cause a mission to fail. Gunners, therefore, held 
a unique position in preventing harm from coming to their buddies and 
were indirectly responsible for the success or failure of the mission. 

The following details of a mission were reported to me. Returning from a 
mission, it was necessary for the group to cross the Adriatic Sea in order to 
reach their base of operations in Italy. It was standing operating procedure 
that upon reaching the sea, all gunners would be relieved, since there was then 
little danger of attack by the enemy. It was therefore customary to raise the 
ball gunner. However, on this particular occasion, for some unknown reason, 
the other gunners whose responsibility it had been to raise and release the 
ball gunner failed to do so. The ship ran out of fuel and it had to be ditched. 
It was too late to do anything about the trapped gunner. 

A second case concerns a nose gunner in a B-24. It was customary upon 
approach to the bomb run area for each crew member to don his flak suit. In 
the case of pilots, nose, and ball gunners, they are helped into their suits by 
nearby crew members. In the case of the nose gunner, it is the responsibility of 
the navigator to do so. Upon reaching the target area, the gunner began 
screaming over the interphone for his flak suit. The navigator was busy chart- 
ing the bomber’s course. It was the navigator’s feeling that at the time he was 
in no position to give attention to anything other than his job. An 88 mm. shell 
burst just ahead of the ship and a piece of flak killed the gunner. 


In the case of the fighter pilot, one would be inclined offhand to 
think that they had little responsibility to exercise—perhaps, only in 
the sense of completing a,strafing mission, or a dive bombing mission, 
or just escort for a group of heavy bombers. Upon closer examination, 
however, it will be found that their responsibilities required as much of 
them as was the case with bombardment personnel. 

Tactical expediency in the case of fighter formations r-cessitated 
the use of two-plane elements. There was a lead plane and a wing ship. 
In this manner, it was difficult for an enemy plane to sneak up behind 
one without the other being aware of it. In combat, the iead plane did 
most of the aggressive fighting and the wing plane afforded protection. 
Credit for destroying enemy aircraft went to the lead ship by virtue 
of its function. Honors in battle were strongly vied for and intense 
jealousies were evident between lead pilots and wing pilots, since it was 
held that undue credit was being given to the lead pilot. The need, how- 
ever, for this arrangement of planes in combat was manifestly neces- 
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sary for the safety of all concerned and the necessity for having a 
responsible wing pilot was very apparent. Unfortunately for both pilots, 
the wing pilot often forgot this responsibility and sought after a little 
glory of his own, with the result that both planes would suffer the almost 
inevitable consequences. 

A second responsibility of fighter pilots was manifest on cover or 
protective missions, that is, in escorting heavy bombers and protecting 
them from enemy fighter opposition. The need for this cover was es- 
sential and bomber personnel were always very grateful because of it. 
Failure on the part of fighters in meeting the bombers at the proper 
time led to serious consequences. A bomber could not weave about the 
sky in wait for its cover, since fuel consumption was a vital problem in 
every attack and target time was strategically set. The result of any 
failure of fighters and bombers to coordinate was evidenced by the 
high bomber losses incurred on the raid. 

Fighter pilots were always attracted to grounded enemy planes and other 
land targets. Their failure to meet pre-arranged schedules with bombers was 
often due to this fact; taking time to strafe grounded craft, supply trains, and 
other targets delayed their time to such an extent that for all practical purposes 
they were useless. 

In sum, responsibility as an emotional stress entails a number of 
factors. It consists primarily of an alertness, apprehension, and atten- 
tion to the many details that go to make up the job that has to be 
performed. It is not just a realization that so much has to be accom- 
plished; rather it isa knowledge of the necessity of performing that job. 
It consists of a concentration of energies and attention upon the rami- 
fying points entering into performance of duty as a whole. Failure to 
achieve any of these components ultimately leads to a conscious disa- 
vowal of responsibility or unconsciously to gross mistakes that in the 
final analysis must be interpreted as failure to act responsibly. The 
end product of all of this is, of course, some sort of mental conflict. 

T..e concept of responsibility may be looked upon from another 
point of view, namely, lack of responsibility as a function of or symptom 
of an already disterted personality. Thus, as was the case many times, 
crew members who had already acquired conflicts of one sort or another, 
such as, anxiety states, fears, and the like were by reason of their con- 
dition unable to act in a responsible manner. They were not only a 
danger and detriment to themselves and their crew mates, but in a 
larger sense to the entire formation of which they were a small part. 

A pilot failed to pull out of formation after being hit by a flak burst. It 
was standing operating procedure that when a ship was seriously hit and there 
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was some danger of explosion, that the pilot was obliged to get the ship out of 
the formation in which it was flying. This pilot neglected to do so; the result 
was that he blew up and caused a number of other ships to go down with him. 
Upon subsequent investigation, the following facts concerning the pilot were 
brought to light: he had somewhat over 35 missions; he had been of late speak- 
ing of his extreme fear and anxiety of being hit by flak and catching on fire; he 
had exaggerated anxiety symptoms before being briefed, and had expressed 
the hope before briefing time that the mission for the day would be an easy 
one; and the like. There is little doubt that this pilot was mentally incompetent 
at the time he was flying the mission. 

The weight of responsibility, its conditions and demands was at 
times so highly charged that emotional stability was made impossible. 
On the other hand, responsible behavior could not be exacted from some 
individuals since they had already succumbed to some form of. person- 
ality disorder. 

Fear. It was frequently asserted by combat personnel that there 
was no one man who at some time or another did not experience the 
harsh feeling of fear before or during a mission. The truth of this state- 
ment becomes apparent when one takes into account the many mani- 
fold experiences to which the combat man was constantly subjected. 
There is then little wonder that we list fear as one of the major stresses 
operating at all times to distort and disorder the personalities of combat 
men. It is not so much the fact that fear operates independently of any 
other stresses already mentioned or to be mentioned; rather it is the 
unique manner in which it manifests itself during the daily routine of 
flight personnel which constitutes fear a special type of problem. The 
common experiences of flak, gun fire, explosions, and the like were in 
themselves great dangers and were capable of producing fear reactions; 
however, in a great number of instances, anticipatory fears of situations 
produced more marked reactions (5). Perhaps it was the mystery con- 
nected with these possible episodes (e.g., bailing out, ditching, being 
strafed, etc.) which made them all the more fearful. 

The development of fear follows a definite course; a course which 
parallels the combat career of every person so engaged. In a number of 
cases, persons responded to fear in new and resourceful ways and eventu- 
aliy overcame it; in other instances, it was fear that won out. Hastings 
describes the development of behavior in which fear plays a most im- 
portant part: 

1. On arrival at an operational station the men were first of all insecure and 
defensive, both consciously and unconsciously, and this was apparent in many 
ways. In action they were either overly self-assured, or particularly diffident, 
usually the former. In speech they might be either loud and continuous, or in 
a few cases ‘‘mouse-quiet.” They either spoke continuously of combat or 
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avoided it completely. They either ridiculed and paid no heed to any advice 
by experienced men or they took in every possible word of it. They tended at 
first to drink more than the others. They did not accept the possibility that 
they would ever be afraid, and openly spoke of the older men who mentioned 
fear as being “‘flak-happy”’ or spiritless. It was quite easy to spot a new group 
of officers at a table in the mess or in a group in the lounge, even if one knew 
none of the personnel of the station. 

2. These defensive attitudes and mechanisms began to disappear quite 
quickly after the men had four or five raids, and had seen and felt the real 
factors in the combat situation. This process was frequently conscious to a 
large degree, and in many cases, the men spoke of the change in themselves, 
and shamefacedly deprecated their former cocky attitude. At about this point 
it was frequent for them to ‘‘over-swing,” and to be very conscious of their 
anxiety, having somatic symptoms, and paying attention to them, and feeling 
quite hopeless about their chances of survival, sometimes in consequence, get- 
ting careless of technique, equipment and the like. 

3. The third stage of evolution took place at roughly the tenth raid, by 
which time one or more of several factors had helped effect a further change: 
the man had experienced fear and by now knew that he could deal with it; 
he found that care and skill and coolness in the pilot and crew had a real bear- 
ing upon the question of his return; he saw that his crew and his airplane could 
withstand catastrophe; he developed an “esprit de corps’’ in regard to his 
squadron, and was now really part of it. He developed for the first time a 
sense of his responsibility to his mates, and to the formation. At this stage 
which continued sometimes until the end of his tour, the men were effective, 
careful, fighting men, quiet and cool on the ground and in the air. They at- 
tained a sort of tranquility in spite of their anxiety. They had very little need 
for defensive mechanisms of any sort to deceive themselves or anyone else. 
They talked easily and quietly, drank little except on pass, and expended 
virtually all of their attention and interest on the job. When they did go on 
pass and over-indulged they usually did so in a peculiarly deliberate way, be- 
lieving over-indulgence was a cathartic sort of release of feelings, which they 
felt to be useful. They were drained of most feelings other than those having 
to do with combat. No values existed other than those meaningful in combat. 

4. Frequently a fourth stage of evolution gradually took place by the be- 
ginning of the last five raids. Its components were probably to a large extent 
physiological, the results of continuous prolonged fatigue and fear. It con- 
sisted, in its most extensive form, of a state of insomnia, fatiguability, weight 
loss, anorexia, indifference, restlessness, loss of concentration and interest and 
efficiency, marked irritability, loss of libido, and a fairly marked depression 
with retardation. It did not necessarily include all of these components, but 
perhaps only several of them; these symptoms could not be considered in the 
category of neurotic manifestations, in that they did not cloud the real issue 
from the flier’s consciousness. He had usually complete insight into their 
cause and mechanism. They also did not give him any “secondary gain,” in that 
he did not accept the release from combat that they might have afforded (5, pp. 
20-25). 


The element of fear may be the result of any of the many factors 
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encountered in combat flying. Some feared take-offs, for in many types 
of combat ships the stability of the craft decreases with bomb load and 
increased use in battle. Stories became numerous of accidents which oc- 
curred on take-off and the results of these accidents often acted to 
create phobias in inexperienced pilots. An engine may fail on take-off, or 
a tire may blow out as the plane rushes down the runway—with a full 
load of gasoline and thousands of pounds of bombs such incidents be- 
came tragic affairs. And those who have witnessed such happenings 
readily know the mask of fear on the faces of crew members as they 
rushed from the burning plane, which, in a matter of minutes, will be 
blown to bits by the exploding gasoline and bombs (2). 

Night missions were especially feared by most combat men. The 
uncertainty of night flying and its added hazards made for extreme 
tension and anxiety. Such conveniences (considered in the United 
States as common necessities) as lighted runways, guide markers, for- 
mation lights, radio aids, etc. were not routinely employed in combat 
for reasons of security. This lack made night take-offs in flying ex- 
tremely hazardous, and pilots as well as other crew members reacted 
accordingly with signs of fear and anxiety. The proportion of accidents 
between daytime and nighttime flying was consistently higher in the 
night missions. 

On daytime raids, the fear of take-off is just as intense as it is on 
night missions. Perhaps the most critical moment for all crew members 
simultaneously was that of the take-off. One is sure of a certain degree 
of success in connection with the mission as a whole once the ship be- 
comes air-borne. Erroneous as this feeling may be, the psychological 
boost which occurs in combat men once they have become air-borne is 
not to be minimized. Watching a take-off mishap impresses one with 
the extent of helplessness manifested by the crew members involved. 
In a sense, this is not true of the many other dangers connected with 
flying, wherein certain defensive measures can be employed. The alti- 
tude attained on take-off does not allow for freedom of maneuver or the 
use of adequate safety measures, and when an accident occurs the end 
is inevitable: ships are completely blown up or consumed by fire in the 
space of a few minutes. The observer is impressed with all of this, and 
in some respects there is a certain amount of comfort in the rapidity of 
it all. Many combat men become fatalistic after a short time in combat, 
and experiences of such accidents seem to offer some satisfaction in the 
hope that their end, if it should come, might be a similar one. 

That take-off and landing accidents were traumatic situations for 
the production of fear reactions certainly is undeniable and there can 
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be little doubt that almost every combat man at one time or another 
witnessed such occurrences. This being so, the extent to which these 
experiences acted to produce fear and anxiety cannot be over- 
emphasized. But, as we have already mentioned, combat experience 
along with the general personality configuration acted to minimize or to 
emphasize the final effect produced. 

Fear of flak and fighters formed two other bases for fear of a most 
effective sort. At the beginning of a tour of duty, the fear of flak was 
not very apparent; however, when flak came into close range and com- 
bat personnel actually tasted some of its ‘bitter fruit,’ then fear of it 
became in many instances obsessional. The inexperienced flier did not 
for a moment realize the nature and complete effectiveness of this type 
of weapon, and for this reason new combat men tended to under- 
estimate the destructiveness of flak. The more common notions about 
flak were: to be effective, flak had to burst very close to a plane; that 
enemy flak batteries were not sufficiently accurate to score direct hits; 
and that there were not enough flak batteries around any single target 
area to be cause for alarm. Experience, however, very rapidly changed 
such opinions. It was learned that flak, like shrapnel, was very deadly; 
that its range of effectiveness was wide; that the Germans had their 
guns radar controlled and that a fair degree of accuracy was possible; 
and that the number of guns around many targets ran into the hundreds. 
It was not at all surprising to note the extent of change which did take 
place after contact with this weapon. 

It was common practice for new crews to complete their first few missions 
over relatively easy targets, that is, targets which were not heavily defended 
and at which little fighter opposition could be expected. The rationale of this 
procedure was to bring about a gradual introduction to the rigors of combat. 
As the missions became ‘“‘tougher,’’ one could notice a decided change in the 
behavior of the men: they became restless and revealed considerable anxiety 
about the type and objective of their next mission; anticipatory fears developed 
connected with their position in the formation which connoted varying degrees 
of vulnerability by flak and fighters; and, in brief, a great many other sympto- 
matic changes of personality attendant upon fear and anxiety. There were 
many times in my own experience when flak was so heavy over a target that 
the expression “‘you could roller skate on it’’ was a somewhat accurate descrip- 
tion. Such targets as Vienna, Wiener-Neustadt, Munich, Regensburg, etc. 
were reputed to have had many hundreds of flak guns for their defense. 

The accuracy which flak achieved was in many instances outstanding. Guns 
were radar controlled and range estimation was thereby easily and accurately 
computed. With any large formation of bombers, in the time that it took two 
or three groups (a group consisted of approximately twenty-five planes) to 
pass over the target, flak was very accurate and deadly. Thus, when forma- 
tion notices were posted, the men would be very anxious to know in what part 
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of the formation they were to fly. Asa matter of preventive psychiatry, these 
formation notices were often restricted to the time of briefing, so as not to 
cause undue anxiety among the men who were to fly in the rear echelons. 

To repeat, first contact with flak usually did not arouse many fears 
in combat men, for inexperience with the effects of flak, wrong notions 
concerning its actual and potential danger, and the practice of starting 
new men on relatively easy missions were operative in the early stages 
of a combat career. With every mission, however, crews learned what 
flak could do: physical injury by flak was very painful and many times 
fatal; damage to planes occurred in places which one would never im- 
agine to be vulnerable; and most effective was explosion due to a direct 
hit in the bomb-bays which set off the bombs or gas tanks. Combat men 
witnessing mid-air explosions had their fears enhanced considerably by 
such incidents. 

An enlisted man who had completed fifty-two missions had an extreme 
fear of flak, the mere sight of which caused him to be paralyzed with fear. 
Often he would “‘freeze’’ on the toggle bombing switch and on several occasions 
he would not close his bomb-bay doors until called by the pilot to do so. While 
accumulating his fifty-two missions he had witnessed eleven of our aircraft 
shot down; with the eleventh craft were lost two of his close friends. On the 
day he was grounded and hospitalized his own crew was lost to flak—exploded 
over the target. After hearing of this, he expressed an ardent appeal to dis- 
continue flying altogether. (From a report of a Flight Surgeon attached to a 
B-26 Bomber Group). 

In sharp contrast to the reaction to flak was the effect of fighter op- 
position in the production of fear reactions. Fear of fighters was not a 
common experience among crew members. As it was often put, “At 
least one was able to do something about the damn things;’’ this was 
not exactly the case with flak. Those men who did react with fear and 
anxiety toward fighters did so because of the unfortunate and disastrous 
traumatic experiences with them. Formation flying was not only de- 
signed to bring about optimum conditions for bombing a target, but 
also to maintain a maximum of protection against fighter opposition. 
The possibility of becoming disabled for any one of a great number of 
reasons and having to leave the protection of the formation loomed large 
as a potent fear producing stress. But the factor of self-defense in being 
able to fight back gave one a chance to alleviate his fears; the personal 
implications of the threatened danger were minimized through a con- 
structive attack upon the danger itself. 

Toward the closing phases of the war our fighter planes were so 
numerous and the protection so efficient, that enemy opposition directed 
against our bombers became inconsequential. As a result of our superi- 
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ority personal dangers in this direction were considerably lessened, and 
the incidence of fear reactions correspondingly decreased. 

Anxiety. Aero-anxiety ranks next in importance to the factor of fear 
in the production of operational fatigue (8). To repeat, the incidence of 
any of these causal agents often went hand-in-hand and this is especially 
the case with anxiety and fear. Many times it was difficult to differenti- 
ate between fear and anxiety; it was very rare that one occurred without 
the other. This is understandable in view of the nature and conse- 
quences of fear itself. Thus, one of the after-effects of fear is anxiety— 
as manifested in anticipating the same incidents which originally pro- 
duced the fear. With an intensification of the anxiety state there is a 
corresponding increase of fear and a lessening of responsibility. In a 
great measure, all the factors stressed as important in the production of 
loss of responsibility and of fear are likewise operative in the production 
of anxieties. 

SUMMARY 

We have attempted to make clear some of the more important agents 
influencing operational fatigue—defined as abnormal flying strain being 
placed on a normal individual. We have seen that they are very com- 
plex; that they do not occur in isolation, but rather tend to interact and, 
in a number of instances, are all present together. In this respect, each 
becomes a function of every other, and the occurrence of one becomes a 
condition of the eventual appearance of the others. Thus the burden 
of supposedly excessive responsibilities brought on the retardation of 
performance, and prepared for the emergence of fear and anxiety. In an 
analogous manner, experiences which produced fear also brought with 
them anxieties and loss of responsibility. 

The question of predisposition toward personality breakdown has 
not been treated in this paper for the obvious reason that at the outset 
the assumption was made that all personnel engaged in the hazardous busi- 
ness of flying could succumb to operational fatigue. It is debatable, of 
course, whether environmental, climatic, emotional, and similar factors 
are causative in a true sense, or whether they are the triggers that set 
off neurotic behavior in a predisposed personality. Clearly, the litera- 
ture indicates that those men who did break under the strain of battle 
would in a majority of cases not have done so under the conditions of 
peace-time living. The predispositions which they might have possessed 
would have been of little significance in civilian life; however, because of 
the extreme and prolonged types of experiences in combat, they were 
bound to react in violent ways. No amount of psychiatric screening 
could have eliminated these individuals for their condition was in no 
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sense deep-seated enough to allow for adequate and complete airing. 


Rehm (9) concludes this matter nicely: 


background. ... 


It is a matter of general understanding and acceptance by medical personnel 
and laymen that air crewmen are a highly specialized and trained group. In 
recognition of the severe strain to which these men are exposed, diligent care 
has to be taken to see that they are maintained in the finest physical condition. 
However, other factors, less tangible, which have a direct bearing upon the 
ultimate combat efficiency of these men, have been placed more or less in the 


These factors are the psychological disturbances which arise 


in combat flying personnel—the “gremlins” of the mind, in whose grip even 
the strongest willed man is powerless. 
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A REVIEW OF LEADERSHIP STUDIES WITH PARTIC- 
ULAR REFERENCE TO MILITARY PROBLEMS 


WILLIAM O. JENKINS 
Indiana University 


INTRODUCTION 


The present report summarizes and reviews selected references from 
the available literature dealing with the problem’of the selection of 
leaders in various fields. The primary interest in preparing the article 
was to provide a summary of techniques and results that would be of 
value to psychologists dealing with problems of selecting leaders, par- 
ticularly in the military field.* For this reason, no attempt has been 
made to cover the extensive literature concerning dominance and 
“leadership” with other than human subjects. 

The primary factor considered in selecting items for inclusion in the 
present article, in addition to their relevance to military selection prob- 
lems, was whether or not the material was empirical in nature. For 
illustrative purposes a few speculative reports have been included, but 
the main emphasis is on the presentation of research findings. 

No attempt is made to treat the theories of leadership which have 
been proposed. Few of these theoretical expositions have presented hy- 
potheses concerning the various aspects of leadership which are testable. 
Furthermore, none of them has been comprehensive and systematic to 
the extent of accounting for the obtained information. In a late section 
of this article certain general principles and hypotheses extracted from 
the available literature are presented. 

For the purposes of this article the dictionary definition of leadership 
as the act of guiding or directing the behavior of one or more individuals 
may be employed. A more adequate operational definition should de- 
rive from future research on this problem. 

For clarity of presentation the various studies of leadership have 
been divided into five groups: (1) industrial and governmental investi- 
gations, including studies of executives, administrators, supervisors, 
foremen, etc. ; (2) studies of scientific and professional personnel; (3) in- 


* This article was originally published as No. 199 in the AAF Aviation Psychology 
Abstract Series issued by the Psychological Branch, Office of the Air Surgeon, Head- 
quarters Army Air Forces, Washington, D. C. The date of publication was Sept. 20, 
1945. The purpose of the original report was to present typical procedures and results 
obtained as background information for Army Air Forces Aviation Psychologists working 
on problems of leadership. 
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vestigations of the activities of children in pre-school and extra-school 
situations; (4) studies in the school situation; (5) military leadership. 


INDUSTRIAL AND GOVERNMENTAL INVESTIGATIONS 


Two of the earliest studies of the characteristics of business execu- 
tives in this field were published by Gowin in 1915 and 1918 (26, 27). In 
the first study a questionnaire was submitted to approximately 1,000 
executives, 225 lesser executives, and 200 professional personnel. Data 
were also gathered on 222,000 insurance policy holders. Information 
concerning various characteristics of these individuals was requested. 
Measures of statistical significance were not presented, but differ- 
ences between the major executives and the insurance policy holders 
were reported with regard to height and weight, favoring the former 
group, and the executives were reported to have been subjected to some- 
what stricter selection than the professionals as indicated by a lower 
coefficient of variation. The executives did not differ greatly from the 
other groups with regard to age at marriage, number of offspring, and 
similar items. ‘‘Attitude toward life work’’ was rated by the investi- 
gator on a ten-point scale from ten (continued one line of activity 
throughout life) to one (changed line of activity three or more times). 
The results for approximately 200 business executives were compared 
with a control group of ‘‘professionals.’’ No outstanding differences ap- 
peared. 

In Gowin’s second study 276 business executives were asked to rank 
the importance of a number of qualities for administrative ability. The 
qualities ranked at the top were judgment, initiative, and integrity; 
those ranked lowest were refinement, appearance, and sense of humor. 
Laird’s (33) findings were similar to these. No definitions of these con- 
cepts were presented nor was further use of the data reported. 

A number of other studies involved the administration of question- 
naires to business executives. In one study, Taussig and Joslyn (55) 
investigated the social classes from which American business leaders are 
recruited; attempted to determine the proportionate contribution of 
each social class to the supply of business leaders; and proposed to study 
the relative influence of heredity and environment on such disparities 
as might exist between the representation of the classes among business 
leaders and their representation in the population at large. A sample of 
15,000 business leaders was selected from a register of directors and 
complete returns were received from somewhat over 7,000 cases on the 
nine-item questionnaire which was employed. The modal individual 
emerging from the reports of this study may be described as follows: 
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president (for less than ten years) of a manufacturing or mining organi- 
zation, the gross income of which is between one million and five million 
dollars per annum; a background of living in a community with a popu- 
lation over 500,000 in New York State; between the age of 51 and 52 at 
the time of questioning and between 41 and 43 at the time of entering 
the business; a college graduate with no formal business training who 
reports little or no help in starting in business. 

The authors conclude that the results suggest that lack of native 
ability plays a greater role than lack of opportunity in the failure of the 
lower occupational classes to be as well represented in the sample as the 
higher classes. The evidence, however, was negative rather than posi- 
tive in nature. 

Starch (54) sent a questionnaire to fifty executives making salaries 
over $50,000, to 50 making between $7,000 and $50,000, and to 50 with 
salaries below $5,000. In the opinion of the members of these three 
groups, four characteristics of executives were important: ability to 
think, inner drive, capacity to assume responsibility, and ability to 
handle people. The proportion of the high salary group rating ability 
to think as important was 72%; the percentage of the middle salary 
group considering this aspect important was 58%; and the percentage of 
the low salary group was 46%. For the other characteristics the per- 
centages, respectively, were: inner drive 58%, 42%, and 20%; capacity 
to assume responsibility 68%, 54%, and 20%; and ability to handle 
people 84%, 72%, and 86%. On the basis of these results Starch pro- 
posed a formula to account for differences in earning potential where 
“executive achievement”’ was the product of ‘“‘drive’’ and the sum of the 
factors of ‘‘ability to think,” ‘capacity to assume responsibility,” and 
the ‘‘ability to handle people.’’ Only the results of the questionnaire 
were presented, as given above, and no additional data were reported to 
support the use of such a formula. 

An investigation was made by Sorokin (53) of the biographies of 
1,600 leaders of labor and radical movements. Leaders were classified 
according to their socio-economic background and similar factors. The 
findings indicated that the majority of the labor and radical leaders 
come from families in which the paternal occupation is professional, 
business, or managerial. 

A somewhat different survey approach was employed by Likert (34) 
in a study in which the relationship of managerial attitudes to the 
morale of life-insurance salesmen was investigated. In this study, ap- 
proximately 300 insurance agents in twenty different agencies were 
interviewed systematically and each agent filled in a questionnaire con- 
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cerning his attitudes toward the firm, the management, etc. Each agent 
received a morale score based on the interview and one derived from the 
questionnaire, with the two scores correlating .85. 

The agents indicated that the manager was the chief influence on 
their morale. The major characteristics of the manager which were 
considered important by the agents in this respect were: his attitude 
towards them; his personality in general; and his professional skill. 

A number of investigators have attempted to develop test batteries 
to select executives or administrators. Among these may be listed 
Cleeton and Mason (12) whose battery of tests carries the rather elegant 
title of ‘Vocational Aptitude Examination for Sales, Technical, and 
Executive Groups.’’ It was reported that this battery had been ad- 
ministered to personnel in executive positions, but no data were pre- 
sented in this regard. 

Uhrbrock and Richardson (58) administered a battery of nine tests 
to 163 supervisors. Ratings by superior executives were employed as 
criterion scores. Out of a total of 820 items in the nine tests, only 85 
were found to have significant predictive value. The majority of the 
valid items were drawn from company information tests. The following 
personal history items were also significant in this study: age, schooling, 
and military service record. 

In another industrial investigation Bridgman (8) performed a follow- 
up study of the success of 1,310 college graduates in the Bell Telephone 
System. The criterion involved was salary achieved in the company. 
The findings indicated that high scholarship, campus achievement, early 
graduation, and immediate employment in the Bell System were all 
significantly favorable factors for success in this company. Scholarship 
level during the college career appeared to be the most significant factor. 

A study of 100 presidents and vice-presidents of ‘“‘successful’’ com- 
panies has been reported by O’Connor (44). It was stated that five 
characteristics of executives were isolated and measured in the course of 
this research. The five items listed were: (1) large English vocabulary, 
(2) many aptitudes, (3) objective or extremely objective personality, 
(4) accounting aptitude, and (5) aptitude for first position. Descriptions 
of the instruments, conditions of administration and scoring, and data 
supporting this statement were not presented. 

Beckman and Levine (4) studied the Allport A-S Reaction Test, the 
short form of a Personality Inventory (C-2), and a Directions Test in 
an attempt to predict supervisory ability. Various efficiency ratings 
were employed as criteria for an experimental and control group of ap- 
proximately thirty cases each. On the basis of a correlation of .33 be- 
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tween supervisory ratings and scores on the Allport Test in this small 
sample, it was concluded that this test is of value in selecting executives. 

Along these same lines, a study has been performed concerned with 
the administration of a mental alertness test to twenty-eight minor 
executives in a clothing manufacturing establishment (31). These men 
were rated by six superior executives, and it was reported that close 
agreement was found between the test scores and the ratings given the 
men by their superiors. As in the reports of many of these investigations 
the details necessary for critical evaluation were omitted. 

In contrast to these studies reporting appreciable correlations of 
predictive instruments with ratings by supervisors for executives and 
administrators in industry are results reported, among a number of 
others, by Bingham and Davis (7). In this study, an intelligence test 
was administered to 102 business executives, and experience records 
containing personal information about the individuals were obtained 
from 73 of these men and employed as the criterion. No agreement be- 
tween the two sets of data was found according to the authors. 

Thurstone recently studied administrative ability in governmental 
work (57). A group of perceptual tests which had been shown to dif- 
ferentiate a small group of campus leaders from non-leaders was em- 
ployed along with several new tests that were assumed to predict 
administrative ability. In the first phase of this study, a group of in- 
terns in governmental administration served as subjects. The ten 
interns with the highest supervisors’ ratings of professional promise and 
success and the ten with the lowest ratings were selected and their test 
scores compared. The tests which were found to be most discriminating 
were Gottschaldt Figures, Street Completion, Kohs Block Designs 
(negatively), and a Two-Hand Tapping Test. 

In another phase of the study, test scores were co:npared with a 
salary criterion corrected for age. For 127 administrators whose salaries 
ranged from less than $3,000 to $10,000 per year, the best single test 
was the linguistic score on the Psychological Examination of the Ameri- 
can Council on Education. Gottschaldt scores yielded a prediction 
which was almost as accurate. A Classification Test involving categori- 
zation of cards with names printed on them, differentiated in that the 
higher paid administrators used fewer categories and fewer single card 
groups. On a test requiring that numerical estimates be made on the 
basis of common knowledge, such as the approximate population of the 
United States, the more successful men on the criterion did better, par- 
ticularly in terms of the proportion of acceptable answers. The following 
scores on the Allport-Vernon Study of Values differentiated the two 
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groups: social, theoretical, economic, and religious. The successful men 
excelled on the first two and were inferior on the last two. They also 
had more masculine scores on the Terman-Miles Schedule for 
Masculinity-Femininity. On the Thurstone Vocational Interest Schedule, 
the administrators who had higher incomes made lower scores in physi- 
cal science, physical activity, and commercial interests. 

Richardson and Hanawalt (50) compared the Bernreuter Personality 
Inventory scores of 258 business men divided into groups of office 
holders and supervisors on the one hand and non-office holders and non- 
supervisors on the other. Office-holders and supervisors tended to be 
less neurotic, less introverted, more dominant, more self-confident, and 
more self-sufficient than the control groups and the norms. The dif- 
ferences were statistically significant for a majority of the measures. 

The use of another procedure that purports to pertain to the selec- 
tion of executives is illustrated in a German study by Luithlen (35). 
This investigation employed a test in which single words were printed 
on individual cards, there being two sets of cards one blue-bordered and 
the other red-bordered. Subjects were paired and one member of each 
pair was given the blue-bordered cards and the other the red-bordered 
ones. Each pair was requested to construct original sentences with the 
cards. In addition to general observations, the measures included time, 
number of constructed sentences, and color and position of the cards. On 
the basis of initial results, individuals were labelled as leaders or fol- 
lowers, and pairs of leaders and followers were given the same problem. 
It was found that when a leader and a follower were paired, the leader 
did most of the work, and when two followers performed, it took longer 
to reach the result, but there was cooperation. When the individuals 
were both leaders, fewer sentences were constructed. Statistical evalua- 
tion of the findings was not presented. 


STUDIES OF SCIENTIFIC AND PROFESSIONAL PERSONNEL 


Investigations similar to those listed above have been conducted 
concerning the characteristics of outstanding scientific and professional 
personnel. Several studies (11, 47, 51, 60, 64) have dealt with the char- 
acteristics of American inventors and have employed a questionnaire or 
some sort of biographical approach which yielded various normative 
data concerning the characteristics of these persons. 

Typical studies of the characteristics of geniuses, utilizing similar 
methods, are contained in reports by Ellis (16) and Giese (25). Infor- 
mation from these investigations has indicated superiority over the 
general population for the specialized groups with regard to such items 
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as socio-economic background and education. They also agree in giving 
ages 25-35 as those of the first achievement for scientists, inventors, and 
literary men. 

An interesting study has been performed by Schneider (52). The 
hypothesis under investigation in this study was that any group of 
distinguished men formed a “‘birth galaxy” associated with the cultural 
situation which, in turn, is related to a definite period of time affecting 
this entire birth group. The individuals studied consisted of approxi- 
mately 200 English botanists born between 1700 and 1920. On the basis 
of published biographical information concerning these individuals, two 
outstanding periods of activity were uncovered. The conclusion drawn 
from the available data was that there occurred a period in the history 
of British botany when there were fewer persons born who became 
famous botanists than for the immediately preceding or following pe- 
riod. It appeared improbable that any racial factors were involved in 
the decline in the number of botanists from 1700 to 1800 since this 
period was marked by a general rise in the number of births of eminent 
Englishmen. Schneider also found that the fame of military leaders was 
associated with the number of conflicts in which the particular country 
was engaged. 

A study has been reported by Flanagan (19), which pertains indi- 
rectly to leadership, concerning the validity of the 1940 edition of the 
Cooperative Test Service National Teacher Examinations. These ex- 
aminations consisted of the following eleven sections: Reasoning, Eng- 
lish Comprehension, English Expression, Current Social Problems, 
History and Social Studies, Literature, Science, Fine Arts, Mathematics, 
Professional Information, and Contemporary Affairs. 

In order to validate these examinations 49 teachers in 22 school 
systems who had taken the tests were selected in such a manner as to 
maximize the range of scores (20). School supervisors then rated the 
teachers on a graphic rating scale consisting of 50 items, and students 
filled in an educational-report form concerning them. The product- 
moment correlation between the supervisors’ ratings on the character- 
istic of ‘‘overall judgment of teachers’ general effectiveness and desira- 
bility’’ and score on the examinations was .51. Positive correlations 
were found between all of the 50 rating items and total score on the 
tests. The lowest correlations were for the following ratings: teacher's 
health, physical appearance, and poise; energy, enthusiasm, and drive 
in school work; quality of speech and voi¢e; sense of humor; congeniality 
of teacher’s adjustment to associates; neatness of teacher’s work in 
classroom; and integrity of teacher’s character. It was found in this 
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small sample that the Current Socia! Problems section of the test yielded 
the highest correlation with the supervisor’s over-all ratings. 

A correlation significant at the five per cent level was found between 
total score on the examinations and the proportion of the students re- 
porting a particular teacher’s name in response to the question ‘Which 
teachers seem to have a broad knowledge of other subjects besides the 
one you had with them?”’ In addition, the correlation was significant at 
the five per cent level between a given teacher’s score on the English 
Expression Test and the proportion of the same teacher’s students who 
mentioned his name in response to the question, ‘‘Which teachers were 
most clear in presenting their ideas?”’ 


PRE-SCHOOL AND ExtTRA-SCHOOL STUDIES INVOLVING CHILDREN 


The three major techniques which have been employed in the studies 
of leadership among children are: (1) observation, (2) nomination of 
companions for leadership positions, and (3) special test techniques and 
situations for identifying leaders. These three techniques are illustrated 
by several studies in the following paragraphs. 

In a study by Parten (45) nursery school children were observed for 
60 one-minute periods and their behavior recorded. There appeared to 
be a greater difference in leadership ability among the children than 
might be attributed to age alone according to the author. In addition, 
it was concluded that the leaders were above average in intelligence. 

The nominating procedure was employed in a study at the Detroit 
Teachers College (64). In this investigation, more than 5,000 grammar- 
school children were asked to write the name of another child that they 
considered their best friend or would like to have as their best friend and 
the reasons for such a choice. The children were also asked to nominate 
the individuals they would like to have as leaders and present reasons. 
There were some outstanding differences reported between the reasons 
for selecting leaders as contrasted with those for selecting friends. 
Ability and achievement appeared to be the main reasons given for 
selecting a particular individual for a leader, and social characteristics, 
such as good sportsmanship, were much less significant for the leader- 
ship choices than they were for the friendship selections. It was found 
that a small number of children, about two to five in a given grade, re- 
ceived most of the votes. 

In another study involving the nominating technique, Partridge (46) 
studied leadership in a sample of 226 Boy Scouts. The five-man-to-man 
rating technique was employed in which each subject rated every other 
individual in the group on leadership ability in groups of five cases with 
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the names presented in random order. The characteristics of those se- 
lected as leaders were recorded and compared with the ratings. It was 
found that leaders were generally superior individuals in such factors 
as height, weight, intelligence, age, and so forth. Wide individual dif- 
ferences appeared in the characteristics of leaders in different groups. 

Partridge also studied the influence of leader attitudes upon group 
attitudes by having the group fill out a questionnaire and selecting items 
on which differences of opinion existed. A group discussion was held 
with a particular leader in charge and the group was then asked to vote 
again on the questions. The number who changed their opinions after 
learning the attitude of the leader was counted. The results of this 
study showed that the amount of influence leaders had upon group at- 
titudes was greater than that of individuals who were not leaders. The 
differences were statistically significant 

An early study by Terman (56) illustrates the use of special tech- 
niques in studying the problem of leadership. In this investigation 
groups of four children, boys and girls separately, were presented with 
ten pictures and objects on a large card and told that the experiment was 
a memory test and that they could look at the card for ten seconds and 
were then to answer questions concerning it. The answers were recorded 
in the order given. In a second series of tests, different subjects were 
used and the original groups were reformed. Each new group contained 
at least one individual who had been a leader in responding in the previ- 
ous series and one who definitely had not been a leader. An analysis was 
made of the number of first, second, and third responses that were made 
to the stimulus cards in comparison with information concerning the 
children. The results on the small sample involved indicated that the 
leaders tended to be larger, brighter in school work, better iooking, 
better dressed, more widely read, less emotional, more fluent, of more 
prominent parentage, and more daring. 

Typical of a series of researches by Lewin and his associates is a 
study by Bavelas (3) concerning morale and the training of child leaders. 
The study dealt with the retraining of leaders with low morale. Three 
mediocre leaders on a WPA project, as judged by ratings, were selected 
and equated to a control group of three cases on the basis of age, sex, 
length of time on WPA, length of time on present WPA project, rating 
of technical skill, rating of leadership ability and relevant life history 
factors. All six individuals were tested by observing and recording their 
actual behavior with children on the job prior to and after training, and 
by recording the behavior of the children supervised by them. The 
experimental group was trained for two hours a day for a period of 
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twelve days by indicating to them in detail the attitudes, objectives, and 
philosophy of recreational group work. During this time the experi- 
mental and control groups continued their work with the children. Dur- 
ing the fourth week both the trained and the non-trained individuals 
were tested again on the job by the same methods of observation and 
recording. Prior to training, leaders employed an authoritarian form of 
giving commands approximately 60% of the time, and the remaining 
portion of the time responded to the approach of the children. Follow- 
ing training, the experimental group used praise, invested the children 
with responsibility, and revealed their preferences rather than employ- 
ing their previous approach. The number of children working togethe? 
more than doubled for the experimental group while increasing approxi- 
mately 40% for the control group. 


STUDIES IN THE SCHOOL SITUATION 


As in several of the previous areas treated in this report, three gen- 
eral techniques have been employed in studies of leadership behavior in 
the school situation. These are the nominating procedure, the survey 
procedure, and the use of special test techniques. Several examples of 
each of these procedures are given below. 

In a study which illustrates a type of nominating approach to the 
problem of leadership, Nutting (43) had a group of girls list the reasons 
for selecting certain individuals as captains for their gymnasium teams. 
An examination of the characteristics of these captains indicated that 
they were slightly above average in age, physical ability, and intel- 
ligence, and slightly below average in scholarship. They were noticeably 
high in “popularity.” 

The survey procedure is exemplified in a number of studies. Cald- 
well and Wellman (10) found that leaders in different activities ex- 
hibited different characteristics; for example, boys chosen as class presi- 
dents or other representatives were taller than average whereas the girl 
leaders were approximately average in height. Practically all leaders 
were high in scholarship, and although physical achievement was an 
important characteristic of athletic leaders, it did not appear to be rele- 
vant for other types of leadership. According to the results of a rating 
scale for extroversion, most of the leaders were more extrovert than in- 
trovert. Studies by Bellingrath (5) and Brown (9) employed similar 
procedures and yielded comparable results. 

Zeleny (62) tried out several different procedures for selecting 
leaders in a group discussion situation. The first technique tried out was 
“identification by voice and appearance”’ which was reported to be suit- 
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able for speedy preliminary selection of leaders as judged by the super- 
visor of the group discussion. After groups of four or five individuals 
were formed, the ‘‘status-ranking technique’ was used in which each 
individual ranked all others in his group and the average rankings were 
checked against special ratings of leadership by the group and by the 
group supervisor. This technique was reported to show some relation- 
ship between those selected by the group and the professor’s ratings of 
the individuals on leadership qualities. The third procedure employed 
was the five-man-to-man rating technique. In Zeleny’s situation a co- 
efficient of .72 was reported between such ratings and the ratings by 
the faculty leader of the group discussion for a sample of twenty-one 
cases. The degree to which the faculty representative was familiar with 
the ratings of the students was not reported. The final technique was 
known as ‘‘sociometric identification” in which each individual listed 
five choices for leader of the group discussion in order of preference. 
According to the author, by this technique one could determine those 
most chosen and also those with whom both leaders and followers would 
most like to work. 

In another study by Zeleny (63), the five-man-to-man and status- 
ranking techniques were found to yield ambiguous validation data. 

The extent to which several hundred girls desired to live and work 
with one another was rated by each member of the group in an investiga- 
tion by Jennings (29). This procedure yielded a test-retest correlation 
after eight months for positive expression of choice of .65 and for re- 
jection of .66. It appeared that those selected as leaders, to a very 
much greater extent than the average member of the group, construc- 
tively contributed to enlarging the social field for participation of other 
persons. The findings indicated the existence of many individual dif- 
ferences and an overlapping of characteristics between the leaders and 
non-leaders. 

A factor-analysis study of the personality of high school leaders has 
been performed by Flemming (21). The criterion of leadership involved 
the assigning of points based upon positions of leadership held by 71 
girls in grammar school. The predictors consisted of a check list of 46 
traits to be filled out by the teacher for each girl, a ten-point scale con- 
cerning ‘‘the intensity of pleasant feeling’’ that each girl “‘subjectively 
associated with every other girlin her class,”’ and a rating by the teacher 
on a ten-point scale of ‘‘the amount of personality each girl possessed.” 
Thurstone’s simplified method was employed for analyzing the matrix of 
intercorrelations between these two sets of data. 

A correlation of .50 was found between leadership as defined above 
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and personality as rated by the teachers, and a coefficient of .33 between 
leadership and pleasingness of personality as rated by the girls. 

Flemming concludes that there seemed to be four types of leadership 
ability in this group. Scores on the predictors for these selected traits 
yielded a correlation with the leadership criterion of .57 on the same 
sample. No cross application of the weights to a new sample was re- 
ported. 

Thurstone (57) has performed a study in which the scores of 18 
campus leaders on a battery of tests were compared with scores for a 
group of several hundred college non-leaders. One of the most con- 
sistent findings was that the campus leaders were less subject to visual 
illusions as determined by various tests. They gave indication of color 
dominance and superiority on a color-form sorting test, were less subject 
to brightness contrast, and showed a slower rate of alternation of am- 
biguous perspective in the Necker Cube. On the Rorschach, the leaders 
excelled in the total number of responses, were superior in the per- 
ceptual organization score, showed greater latency for color cards, and 
a smaller number of responses to the color cards in comparison to the 
black-and-white cards. 

A comparison of the performance of college leaders in extra-curricu- 
lar activities with the norms for the Bernreuter Personality Inventory 
showed significant differences favoring the former in dominance and the 
latter in introversion (49). Other findings were unreliable or incon- 
sistent. These same authors have reported an item analysis of this in- 
ventory for the responses of 36 women college leaders and 45 non-leaders 
(28). 


MILITARY LEADERSHIP 


A considerable amount of literature has accumulated during recent 
years concerned with non-empirical definitions of military leadership 
and speculations concerning the choice of leaders. More recently, mili- 
tary psychologists have devoted research efforts to certain phases of the 
problem of leadership. Typical information in this area is summarized 
below. 

General. A series of books and pamphlets have been published by 
military writers concerning the problems of military leadership (1, 2, 
13, 37, 38, 39, 48, 67, 70). Unfortunately, none of these tomes is based 
on empirically determined evidence and all of them reflect the personal 
opinions and speculations of the authors. Typical of this group of pub- 
lications is a book by Ageton (1) concerning naval leadership. The 
author lists and discusses the following characteristics of leadership 
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which he considers to be of significance: simplicity, self-control, tact, 
honor, adherence to duty, and loyalty. The following types of catch- 
words and phrases are used to illustrate what the author believes are the 
characteristics of good leaders: practice what you preach, be cheerful, 
be a seaman, know your stuff, avoid careless criticism, etc. 

A manual recently issued by the U. S. Bureau of Naval Personnel (67) 
presents a series of principles for younger officers to be used in selecting 
men for promotions and leadership positions. Certain of the principles 
appear to be based on widely accepted psychological postulates, but no 
evidence to support their use in this context is reported. 

The main item of value that can be derived from the speculative re- 
ports on leadership appears to be an interpretation of the various mili- 
tary regulations which pertain to the problems of leading in military 
situations. These interpretations, discussions, and speculations con- 
cerning problems of leadership and how to behave as a military leader 
may be of value in helping to demark an area of research in which prob- 
lems of leadership may be attacked. 

An obvious starting place for a treatment of military leadership is a 
discussion of the procedures employed for selecting officers and officer 
candidates in various countries. Several of these are described in the 
following paragraphs. 

Selection of officer candidates in the U. S. Army. Typical of the pro- 
cedures employed in the armed services of the United States during 
World War II were those used in the U. S. Army. In this branch of the 
service, there were two types of commissioning: (1) direct commission- 
ing from civilian life and (2) receiving a commission following attend- 
ance at an Officer Candidate School. Procedures for selecting U. S. 
aircrew officer candidates have been described in detail elsewhere and 
will be omitted here (for example, see the series of articles issued by the 
AAF Aviation Psychology Program, this Journal, 1943-1945). 

In the early days of the war, a number of men were commissioned 
directly from civilian life into the U. S. Army. In practically all cases, 
these commissions were for specialists duties, particularly general ad- 
ministration. One factor that played a role in such commissioning was 
previous military experience, either Reserve or Regular Army. For 
such commissioning, there was an age minimum which was 30 in the 
early days of the war, and, in addition, the individual was required to 
pass a physical examination. In most cases the men were required to be 
college graduates. Other than these requirements and that of being 
qualified for some type of specialized duty there were practically no re- 
quirements. A Selection and Review Board examined the available in- 
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formation and accepted or rejected candidates. Occasionally men were 
interviewed by this Board. Practically all of the individuals commis- 
sioned in this manner did not participate in combat operations. There 
was no systematic follow-up study of the on-the-job performance of 
men so commissioned. 

The other type of commission received in the United States Army 
consisted of attending an Officer Candidate School and, upon gradua- 
tion, receiving a commission as a Second Lieutenant. The necessary 
prerequisites for such commissioning were as follows. First, it was neces- 
sary for the individual to make application and to meet certain require- 
ments. Among these requirements were the meeting of a cut-off score on 
a general intelligence test, known as the Army General Classification 
Test, and the passing of a physical examination. Another necessary 
prerequisite for attending Officer Candiate School was that the indi- 
vidual receive the recommendation and approval of his immediate com- 
manding officer who presumably was familiar with the work done by 
him. In addition, each man making application for attendance at 
Officer Candidate School had undergone a certain amount of basic 
training in an enlisted status. The final step in the sequence of applica- 
tion for admission to OCS was the meeting of an Officer Interview Board 
who interviewed each man individually and reviewed the various avail- 
able information on his qualifications and finally approved or rejected 
his application. Many of the criteria employed by these boards were 
subjective in nature. If approved, the individual attended an Officer 
Candidate School and took various courses, differing with the branch 
of the service. If graduated, he received a commission as a Second 
Lieutenant, and, if eliminated, he was returned to duty as an enlisted 
man. 

Selection of officer candidates in the British Army. An early report on 
officer selection procedures in the British Army is available in Aviation 
Psychology Abstract No. 56 (72). More recently a report has been re- 
leased describing testing procedures for officer candidates in Great 
Britain at the close of the war (22). The basic element of the selection 
system was the War Office Selection Board (W.O.S.B.) which consisted 
of a President (Colonel), a Military Testing Officer (M.T.O.) (Major or 
Captain), and a commissioned psychologist and psychiatrist. Candi- 
dates were organized into groups of approximately eight for testing 
which lasted three days. The tests were of three kinds: pencil and paper, 
practical or field, and interviews. The printed tests included an educa- 
tional and occupational questionnaire and a personal and medical one; 
three standard, general intelligence tests including mathematics and 
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figure analogy items requiring about 20 minutes each; and certain per- 
sonality tests. 

There were three practical tests: individual situations, command 
group situations, and leaderless group situations. In the case of the first 
the candidate might be asked to prepare and deliver a short talk to 
fellow candidates on group morale. In the second type of test the 
candidates took turns organizing their groups to solve an immediate, 
practical problem. The last situation was a test in which the group was 
faced with a task such as escaping across an electrically charged fence 
without the M.T.O.’s having appointed a leader. 

The procedures for scoring these various practical tests were not de- 
scribed, but were presumably subjective in nature. The difficulties in 
standardizing and objectifying such field measures in order to achieve 
adequate reliability and validity are obvious although the attempt to 
use such techniques is noteworthy. 

A series of interviews were also employed including a psychiatric 
interview, an “officer quality’”’ interview carried out by the Board 
President, and a technical interview for qualification in special service 
branches performed by a specialist in the particular field. A final Board 
Conference was held to meet the candidate and discuss his potentialities 
as officer material. 

Data comparing ratings of the candidates during training with test 
scores were reported, indicating superiority of the men selected by these 
procedures over those who had entered service prior to the initiation of 
these selection techniques. Without additional information concerning 
the circumstances of the validation, particularly about the criterion of 
proficiency, it is not possible to evaluate the findings. 

Selection of officer candidates in the German Army. A detailed de- 
scription of German testing procedures for officer candidates as of 1941 
is presented in a volume issued by the Committee for National Morale 
(17). The selection techniques are reported to include a life history 
examination, expression analysis (facial, body, voice, appearance, and 
handwriting), mental capacity as measured by intelligence and interests 
tests, including projection and completion examinations, and action 
analysis. The latter consisted of two tasks, the first being the ‘‘command 
series’ in which the candidate was given a series of orders to execute; 
his behavior was observed and recorded by the testers. The second 
phase of action analysis was the leadership test in which a group of in- 
fantry soldiers were placed under the command of the candidate who 
supervised the execution of a pre-assigned task, such as the assembling 
of a prefabricated bridge. The behavior of the candidate in executing 
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this task was noted, and the men working for him were questioned. 
The similarity of these methods to those of the British is apparent. In 
addition, an apparatus test was employed involving differential reaction 
to a number of levers in response to different patterns of red, white, and 
blue lights and to lights with circles and squares which were successively 
lit. The candidate’s errors and speed of response were automatically 
recorded, and his behavior while taking the test was observed. A test 
of choice reaction to auditory stimuli with the Rupp falling rod ap- 
paratus was also employed. 

The leadership examination was conducted at an Armed Forces 
Testing Station by a board of examiners consisting of an Army Colonel, 
a medical officer, and three psychologists. Two full days were devoted 
to testing for the Army and two and one-half for the Air Forces. The 
general tests were given to groups of four and five at a time, and the 
interviews were conducted individually by the psychologists. Round- 
table discussions with the candidate were held and, during the one day 
interval interspersed between the two days of testing, the candidate 
was observed. Procedures for scoring the tests, for combining the scores 
statistically, and for follow-up studies of predictive efficiency apparent- 
ly were not reported by the German psychologists. 

Fitts (18) has recently reported the details of German selection pro- 
cedures, particularly those in the Air Force, based on interviews with 
the military psychologists involved. Several items pertinent to the 
present discussion are contained in his report. For example, he points 
out how political, practical, and scientific factors contributed to the 
breakdown of the German Aviation Psychology Program. Of consider- 
able significance in the present connection is the fact that Fitts was un- 
able to locate any validation data which appeared to meet ordinary 
scientific criteria. The German program is a beautiful example of the 
uselessness of elaborate testing techniques hand in glove with complete 
disregard of their necessary concomitants—standardization, objectifica- 
tion, and validation. 

Research proposals and studies. A number of proposals have been 
made and several researches undertaken dealing directly or indirectly 
with military leadership (see, for example, 6, 14, 15, 30, and 69.) Typi- 
cal of these suggestions and investigations are the following items. 


A group at Harvard (61) has proposed the use of a physical fitness 
test, the Harvard Step Test; an interview; and somatotype measures 
for selecting combat officers. No over-all comparison of combined scores 
with criteria of officer efficiency was presented. 

Murray and Stein (42) have made suggestions concerning the selec- 
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tion of combat officers. Their emphasis and philosophy appear to be 
clinical in nature, not unlike the approach of the German psychologists 
reported previously. The measures included an interview, Murray’s 
Thematic Apperception Test administered individually and to groups, 
a Construction Test similar to the British practical tests and the Ger- 
man leadership tests, and a complex perceptual-motor test taken under 
verbal stress. A group conference is held at the end of testing for a 
terminal evaluation based on test performance and clinica] judgments 
of leadership characteristics. Validation data were not reported for 
the use of this test battery. 

A description of a number of potential measures for selecting combat 
leaders has recently been presented (36). These include an interview 
stressing participation in outdoor activities and a background of self- 
confidence, several perceptual-motor tests, and a printed test of military 
adaptability consisting of a series of multiple choice items in the form of 
descriptions of situations based upon actual combat events. 


The results of a number of military studies pertaining to leadership 
are not available for release at the present time, but several in the 
Army Air Forces have pointed out a number of deficiencies in the re- 
liability and validity of the criteria involved (66, 68, 71, 73). The find- 
ings of these investigations indicate a lack of agreement between inde- 
pendent judges when rating on over-all performance, and recommenda- 
tions were made in these studies for the use of ratings of work samples 
of performance in specific situations which have been directly observed 
as criteria. The difficulties of finding measures that correlate highly 
with success in Officer Candidate Schools have been reported by other 
investigators (23, 24, 74). 

A few investigations of military leadership that have been released 
are treated in the following paragraphs. Several reports of studies have 
been issued by the Information and Education Division of Army Service 
Forces concerned with morale in relation to problems of leadership in 
the U. S. Army (65, 69). 

In the first study privates in two regiments of an infantry division 
took an attitude questionnaire during their first few weeks in the Army, 
and the results of the study were reviewed to see how the morale at- 
titudes of men who rated promotions compared with those of other 
men. The study showed that the privates who became non-commissioned 
officers some months later were likely to differ from their buddies in the 
following regards: better disciplined, more self-confident, more likely to 
feel that what they were doing in the army was worthwhile, and more 
favorable attitudes toward their commissioned and non-commissioned 
officers. The statistical significance of these findings was not reported 
nor were the items on which differences were low or negative presented. 
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The fact that men who get along with their officers tend to be promoted 
is not surprising. As another part of the study, line noncoms in the two 
regiments (assumed to be the best enlisted infantrymen) were compared 
with privates and pfc’s on such factors as education, AGCT scores, 
physical characteristics and mechanical aptitude. The noncoms were 
generally found to have more education, intelligence, and mechanical 
aptitude, and to be slightly taller and heavier than the privates and 
pfc’s. In all the above cases, however, the differences between noncoms 
and privates and pfc’s were small. It was evident that the most striking 
ditferences between the two groups were in their morale attitude. 

Another study (69) reports a check-list of company leadership prac- 
tices. This report is based on a study of practices among 34 Army 
Service Forces companies in the continental United States. The prob- 
lem was the relationship between company leadership practices and 
company morale. Twelve selected companies answered eighteen ques- 
tions concerning leadership practices in their own outfits. Six of the 
companies were all rated high in morale by their post or battalion com- 
mander, their company officers, and their enlisted men. The six other 
companies were all rated low in morale by corresponding judges. The 
companies rated highest in morale by these judges were favorably 
rated by their own men on nearly all company practices. Those rated 
lowest fared very poorly in this respect. The items on which two-thirds 
or more of the men expressed favorable opinions, in all six of the com- 
panies rated highest in morale, involved an expression of interest in the 
men by the officers. 

The Multiple-Choice Group Rorschach Test has been administered 
to 56 officers selected by unit commanders as being actually excellent 
officers (30). Their scores were compared with those of 257 officer 
candidate students enrolled in three classes. The scores for the excellent 
officers were approximately the same as those for the officer candidates. 
The difference was not statistically significant. Forty-five per cent of 
the excellent officers and 35% of the officer candidates had four “‘poor 
responses”’ which was set as the critical score for initial screening pur- 
poses by Harrower-Erickson. Results on a Health Inventory, a Psy- 
chasthenic Inventory, and a Level of Aspiration Inventory also revealed 
no significant differences between the two groups. 

Several studies dealing with military leadership were performed by 
the Applied Psychology Panel of the National Defense Research Com- 
mittee during the war (23, 24). In a preliminary study in Infantry and 
Field Artillery Officer Candidate Schools it was found that individual 
raters tended to be consistent from one week to another when members 


‘ of each platoon rated one another. Correlations between the platoon 


members’ ratings and platoon leaders’ ratings were also high, but ap- 
peared to be spurious since the member ratings were available to the 
platoon leader before his ratings were reported. 
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In follow-up studies data were analyzed dealing with the forecasting 
of success or failure by various predictors in these same schools. It was 
concluded that none of the items obtainable before or upon entrance to 
Officer Candidate School is highly significant of Jater performance; 
quality of academic work is significantly related to success or failure in 
O.C.S.; and platoon leaders’ ratings of leadership correlate highly with 
success as would be expected in this situation where a man is eliminated 
if his ratings by the platoon leader are consistently low. 

In another study, 176 men on combat duty were rated by their 
superior officers on a five-point scale in a situation that indicated that 
the rating officers were taking account of the actual performance and 
were not rating simply on general impressions. There was some spread 
in the ratings with 13% of the 176 officers receiving superior ratings, 
49% excellent, 23% very satisfactory, 10% satisfactory, and 5% un- 
satisfactory. These combat ratings were compared with final company 
ratings in O.C.S., with Army General Classification Test scores, and 
with age. The conclusions drawn from this study were as follows: 


1. Combat efficiency is not very closely related to ratings of leadership ob- 
tained in O.C.S. The correlation between the two sets of ratings was .15 with 
a standard error of .075. 

2. It is reasonably clear that above a certain desirable minimum, intelligence 
as measured by AGCT has little relevance to combat performance. 

3. At the present time the Infantry School is eliminating a large number of 
those who would probably be unsuccessful officers. In addition, there appeared 
to be a practically zero relationship between age and combat ratings in this 
study. 


One study that is of considerable interest in the present connection. 


has recently been undertaken by the Personnel Research Section, C las- 
sification and Replacement Branch, Adjutant General’s Office (6), in- 
volving the development of instruments for selecting officers for 
positions in the post-war Army. Certain of the instruments derived 
from this study are now being employed in selecting Regular Army 
personnel from available applicants. Five kinds of procedures were 
employed in this study. 


The first of these was the Officer Evaluation Repert (OER) which was de- 
veloped as an improvement over the procedure of averaging previous efficiency 
ratings. The OER consists of five sections, the first two of which were designed 
to improve its objectivity. The third section contains five items which appear 
to measure all three of the major named rating factors—leadership, sense of 
duty, and stability which were derived on the basis of a factor analysis of War 
Department and AAF efficiency forms. The last two sections of the OER con- 
cern recommendations for the Regular Army and for the Officer Reserve Corps 
and an over-all rating giving the officer’s standing among previously known 
officers of the same grade. 

The second instrument which was developed is the New Interview Board. 
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Its main function is to evaluate the officer’s ability to get along with people in 
line and staff functions. The interview form provides two work sheets upon 
which to record observations, reactions, and ratings on three areas called bear- 
ing or manner, voice and language, and personality traits. The manual and 
prepared work sheets attempt to emphasize the social nature of the interview 
and minimize the influence of prejudice by forbidding prior acquaintance of 
any one of the five-member board with the applicant, any prior knowledge of 
general experience or background, or any attempt to explore or evaluate any 
phases of these not brought out by discussion during the interview. 

The third primary measure was a Biographical Information Blank requiring 
the reporting of background information of a personal, educational, and occu- 
pational nature in combination with self-description of traits, attitudes, and 
habits. The blank is based upon the previous experience of the National Re- 
search Council Committee on Selection and Training of Aircraft Pilots, National 
Defense Research Committe, the Worker Analysis Section of the Occupational 
Research Program of USES, and the U. S. Navy Department. 

Two other instruments were also developed in the course of this project to 
check general learning aptitude and educational achievement: (1) an Officer 
Classification Test, a test of general intelligence, and (2) a General Survey 
Test, a measure of educational achievement. 


Approximately 15,000 officers participated in the field trials in 
about fifty Army installations. The criterion involved a nomination 
procedure as follows. Approximately 15,000 officers were brought to- 
gether in small groups of 15 to 30 who were well enough acquainted with 
each other’s work to evaluate it. Each officer was requested to prepare 
two lists of the officers in his group, those High and those Low in gen- 
eral all-around value to the Army, listing them from highest to lowest, 
next highest, next lowest, etc. Each officer was requested to designate 
the five officers of the group most closely medium or middle with 
respect to over-all value. Only officers who fell clearly into one of 
the three groups—High, Middle, or Low—in the sense of virtually 
perfect agreement with their fellow officers (never more than one dis- 
senting vote for high or low officers, and never more than two dissenting 
votes in the case of middle officers) were employed for further study. In 
addition any officer placed over one group away by the Commanding 
Offcer was eliminated. A final sample of 3,000 cases was employed 
which appeared to be selected with 1,000 in the Top group, 1,000 in the 
Middle group, and 1,000 in the Low group. 

The Biographical Information Blank and the Interview were com- 
bined with the Officer Evaluation Report by statistical weighting pro- 
cedures to secure a Combined Point Index. Pearsonian correlation 
coefiicients were computed by arrauging the criterion variable in the 
three categories, H, M, and L, and each predictor as a continuous 
variable. These coefficients were as follows: Officer Evaluation Report 
-60, New Interview .39, Biographical Information Blank .35, Com- 
bined Point Index .67, Previous Efficiency Report .45, and Traditional 
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Army Board .09. It should be noted that these coefficients may have 
been increased by selecting samples of 1,000 for the Top, Middle, and 
Low groups from the total sample of 15,000. The degree to which the 
Commanding Officers’ ratings were weighted in the Officer Evaluation 
Report was not stated, but it appears likely that this factor played an 
important role. Substantial agreement between ratings by the C.O. 
and by fellow officers was to be expected. Since the OER had the highest 
validity, and the other measures when combined with it increased its 
correlation with the criterion only .07, these questions suggest the 
necessity for a further examination of the nature of the criterion here 
employed. 

Neither the Officer Classification Test nor the General Survey Test 
was related to the High, Middle, or Low criterion classification of the 
men, a finding which might have been expected since college graduation 
had been a prerequisite for most men directly commissioned and a score 
of 110 on the Army General Classification Test had been required for 
admission to O.C.S. or cadet training. The General ‘Survey Test 
showed moderate agreement with educational level reached. A cut-off 
score on the Officer Classification Test was recommended as a first 
hurdle before the Combined Point Index is applied. With regard to the 
General Survey Test, it was stated that while no cutting score on this 
instrument was recommended, it was believed that this test might be a 
better requirement than a general educational prerequisite. 

The general conclusion drawn from this investigation was that the 
Combined Point Index will tend to select officers satisfactorily on the 
basis of past and present performance. 

A research project concerned with combat leadership has recently 
been reported from the AAF Aviation Psychology Program (15). While 
the findings of these several studies cannot be released at the present 
time, the techniques are worth mentioning briefly. The first technique 
involved the collection of anecdotes dealing with the types of behavior 
exhibited in combat by men designated as successful or unsuccessful 
leaders or by those who assumed this role in cases of emergency. These 
materials were systematized and categorized, and secondly a rating 
scale was derived which served as a check on the results of the previous 
method. The third technique involved the use of one of the more objec- 
tive of the available criteria of leadership, namely, promotions. Data 
from a number of selection devices were compared with this criterion. 


DISCUSSION AND CONCLUSIONS 


As an over-all generalization it appears that Viteles’ (59) statement 
in 1932 concerning research on the selection of executives still holds for 
the general area of leadership: the record of accomplishment is not a 
brilliant one. No single trait or group of characteristics has been iso- 
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lated which sets off the leader from the members of his group. Several 
writers (32, 40, 41) in summarizing the literature in specific areas have 
stressed the cultural and situational determination of leadership. They 
have also pointed out the existence of wide individual differences within 
a given group as well as between groups. 

Advances in methodology in this field are definitely not striking. 
Three techniques have generally been employed: nomination of mem- 
bers of the group for positions of leadership, survey of the characteristics 
of outstanding individuals involving the use of questionnaires or pub- 
lished biographical information, and the use of selection test techniques 
for identifying leaders. Progress has not been made in the development 
of criteria of leadership behavior, nor in the setting-up of an adequate 
working definition of the concept to guide research in the isolating of 
leadership traits. 

The situation does not appear to be a particularly happy one with 
regard to the deriving of general principles or of setting up a systematic 
theory of leadership from the available information. A few statements 
may be set forth, however, that appear to hold for the findings of a 
number of the investigations reviewed ; this list should be thought of as 
a series of hypotheses for further investigation. 


1. Leadership is specific to the particular situation under investigation. 
Who becomes the leader of a given group engaging in a particular activity and 
what the leadership characteristics are in the given case are a function of the 
specific situation including the measuring instruments employed. Related to 
this conclusion is the general finding of wide variations in the characteristics 
of individuals who become leaders in similar situations, and even greater 
divergence in leadership behavior in different situations. 

2. In practically every study reviewed leaders showed some superiority 
over the members of their group in at least one of a wide variety of abilities. 
The only common factor appeared to be that leaders in a particular field need 
and tend to possess superior general or technical competence or knowledge in 
that area. General intelligence does not seem to be the answer for, as one 
writer (32) has pointed out, public leaders have ranged all the way from dull 
normal to genius. 

3. Leaders tend to exhibit certain characteristics in common with the 
members of their group. Two of the more obvious of these characteristics are 
interests and social background. 

4. Certain past history or background items appear to characterize leaders 
in certain activities. There is a widespread but vague hint that certain poorly 
defined personality traits characterize individuals holding positions of responsi- 
bility. It is practically impossible to evaluate these suggestions without addi- 
tional research. 

5. A number of studies suggest superiority of leaders over those in their 
group in physique, age, education, and socio-economic background, but the 
need for further research in this connection is evident. 
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SUMMARY 


This article reviews the findings and techniques of a number of in- 
vestigations of leadership in industry and government, the professions, 
school, and military situations. Special emphasis is given to the proce- 
dures for selecting officer personnel in the armies of the United States, 
Great Britain, and Germany; research studies of military personnel are 
stressed. Findings that appear to be common to a number of investiga- 
tions are presented as hypotheses or possible principles of leadership 
behavior. 
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COMMENT ON “THE VALIDITY OF PERSONALITY 
QUESTIONNAIRES” 


CHRISTIAN PAUL HEINLEIN 
Florida State College for Women 


In the september 1946 issue of the Psychological Bulletin,* Dr. Albert 
Ellis reviews a series of studies involving the use of personality in- 
ventories of different kinds. After considering the arguments for and 
against the various practices of validating personality questionnaires, he 
proceeds to evaluate the validity of selected research results by means 
of the following verbal categories assigned to arbitrary segments of the 
familiar Pearsonian scale of correlation coefficients: 


Negative validity (r’s of from .00 to .19) 

Mainly negative validity (r’s of from .20 to .39) 
Questionably positive validity (r’s of from .40 to .69) 
Mainly positive validity (r’s of from .70 to .79) 

. Positive validity (r's of from .80 to 1.00) 


nm Whe 


In arriving at this five-fold verbal classification of validity, Dr. 
Ellis remarks: 


“Since holding personality test evaluations to terms of D or E rather than 
r would probably be unfair (italics mine) at the present stage of their develop- 
ment, we shall, in this review, usually evaluate the reported coefficients of 
correlation in terms of the conventional estimations given them in the con- 
sideration of psychological and educational tests.” 


What, may I ask, could possibly be more “‘unfair’’ than to delude 
those readers who are unfamiliar with the mathematical derivation of r 
into believing that the phrases ‘‘mainly positive validity” and ‘‘positive 
validity”’ impart operational significance to the results of personality 
questionnaires? The five verbal categories which Dr. Ellis has coined are 
of the nature of semantic blabs, without operational referents. Arbi- 
trary ranges of correlation coefficients, which determine the limits of 
application of the five verbal classifications, can hardly be regarded as 
operational referents. The simple exchange of a verbal sign for a numeri- 
cal sign is not the discovery of an operational referent for the concept of 
validity. 

In his discussion of correlational techniques utilized for the purpose 


* Ellis, Albert. The validity of personality questionnaires. Psychol. Bull., 1946, 43, 
385-440. 
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of validating personality questionnaires, it is obvious that Dr. Ellis 
cognizes r as a trigonometric function; otherwise it is doubtful whether 
he would have mentioned the need of converting r into E; namely, 
1—(1—r’)*, To strengthen this view, we have his expressed conviction 
that ‘“‘many correlations reported on personality-test validity experi- 
ments should not be shown in terms of 7 at all.’’ If one were in a position 
to demonstrate that an r of a given size is an unequivocal index of func- 
tional dependence between two arrays of values normally distributed, 
and if, further, one were in a position to predicate empirically by virtue 
of unambiguous knowledge of the causative conditions which operate 
to provide a specified range of effects in response, then one would need 
the assurance of an r of at least .86 in order to bring about the reduction 
of the probable error of predication to one-half the probable error of 
guessing. In spite of this fact, Dr. Ellis selects an r of .80 as the lower 
limit of his highest category of validity. Conversion of the five succes- 
sive verbal categories of validity into ranges of E provides the following 
highly questionable series: 

1. Negative validity, E’s from .00 to .01 

2. Mainly negative validity, E’s from .02 to .07 

3. Questionably positive validity, E’s from .08 to .27 

4. Mainly positive validity, E's from .28 to .38 

5. Positive validity, E’s from .40 to 1.00 


It can readily be seen from this type of conversion—a conversion 
which Dr. Ellis favors but, paradoxically enough, regards as ‘‘unfair’’— 
that the fifth category covers an extension of value greater than that 
of the first four categories. If the above series seems questionable for 
the purpose of establishing categories of validity, then let it be remem- 
bered that the selection of the r ranges from which the E ranges have 
been derived is equally questionable. 

Unfortunately in the history of psychometrics, the coefficients r and 
E as indices of numerical communality have been grossly misunderstood 
and widely abused. The method of Pearsonian rectilinear correlation, 
which ignores the function of a time-axis and utilizes fixed individual 
deviations in a static matrix of paired numerical values, has been 
falsely identified with the method of concomitant variation. In the 
latter method, a controlled, isolated unit event A is varied by an ob- 
servable, measurable amount to ascertain its determining effect upon 
an observable, macroscopically constant unit event B. Only by a fantas- 
tic stretch of imagination can one attribute cause-and-effect relation- 
ship or a determining, predictive function to an r as such, irrespective of 
the size of the r. By a convenient act of extrapolation, one may choose to 
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ascribe a certain level of validity to an r of a given size. Such practice 
may be one of expediency or wishful thinking, but it certainly is not one 
of sound, scientific procedure. An adequate proof of a pragmatic kind 
is sorely needed to demonstrate the efficacy of r as an index of validity 

£ for a specific type of personality questionnaire or test situation. The 

i arbitrary verbal classifications introduced by Dr. Ellis do not constitute 
this kind of required proof. At most, they merely add to the extensive 
confusion that exists in the literature on test-validation. 
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DISCUSSION OF HEINLEIN’S COMMENT ON “THE VALIDITY 
OF PERSONALITY QUESTIONNAIRES” 


ALBERT ELLIS 
New York City 


Heinlein’s comment on my review of The Validity of Personality 
Questionnaires contains several points with which I concur and several 
with which I must take issue. I shall consider it paragraphically. 

In his first paragraph, Heinlein states that I proceed ‘‘to evaluate the 
validity of selected research results by means of . . . categories assigned 
to arbitrary segments of the familiar Pearsonian scale of correlation 
coefficients.”” There are at least three implications here that may be 
misleading : 

1. That I utilized selected research results; 

2. That I evaluated these results only in terms of correlation coefficients; 


3. That the verbal categories I assigned in evaluation of correlation coeffi- 
cients were quite arbitrary. 


Actually, the facts are these: 


i. I reported all research results that could be found in an extensive review 
of the literature. 

2. I evaluated the correlations of only those studies reporting findings in 
terms of r. Many studies reported critical ratios or other clear-cut statistical 
measures which could be accepted without any special evaluations. 

3. The verbal categories I applied to correlation coefficients were not arbi- 
trarily chosen. They have been used as a rule of thumb in the psychological 
literature for many years (e.g., Garrett, 4, p. 342), are endorsed by many 
authorities, and have been widely applied in psychology classes. 


A valid criticism which Heinlein fails to make of my verbal cate- 
gories is my inadvertently misnaming the first two of them. Strictly 
speaking, there is no such thing as “negative validity.’”’ What I call 
“negative validity’ in my article should really be termed “negligible 
validity,”’ and what I call ‘‘mainly negative validity,”” should be termed 
‘low validity.” 

In the second paragraph of his comment, Heinlein quotes me to the 
effect that holding personality test evaluation to terms of E rather than 
r would be unfair at the present stage of their development. Unfortu- 
nately, he italicizes unfair when the correct emphasis should be on the 
present stage of their development. Throughout his comment he falsely 
emphasizes unfair—henceforth wholly taken from its context—to make 
it appear as if I am contradicting myself. For it certainly would be a 
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contradiction were I to say that using E is an unfair statistical device— 
and then also to say that I favor its use. What my phrasing, correctly 
emphasized, means of course is that the use of E is perfectly fair, 
statistically and strictly speaking, but that because personality ques- 
tionnaires are still (I trust) in their infancy or adolescence, taking r’s 
obtained in validity experiments and reformulating them in terms of E 
would be unduly rigorous at the present time. Consequently—in order to 
give present-day personality inventories every possible benefit of the 
doubt—I consistently evaluated the reported r’s for validity in a con- 
ventional manner—and, as Heinlein points out, in a generous manner 
indeed. Whether I should be that generous twenty-five or fifty years 
from now is highly dubious. 

Heinlein’s third paragraph is concerned with taking me to task for 
“‘deluding”’’ my readers into believing that my verbal categories impart 
operational significance to the results of personality questionnaires. | 
naturally tried todo no such thing. First of all, I was not concerned with 
“the results of personality questionnaires,” but with the results of ex- 
periments attempting to validate questionnaires—which is quite a dif- 
ferent thing. Secondly, I was evaluating r’s only because they had been 
reported by investigators, and not because I personally believe that 
validity experiments should be analyzed in terms of 7. I heartily agree 
with Heinlein that ‘‘the simple exchange of a verbal sign for a numerical 
sign is not the discovery of an operational referent for the concept of 
validity.’’ I must remind him, however, that since r is still being con- 
stantly used as a measure of test validity, and since evaluations of re- 
ported r’s are bound to vary among different observers, some convenient 
verbal labels must often be given to them by general reviewers. All I 
did in my review was to categorize them with those ‘‘semantic blabs’” 
which seem to be most commonly employed by discriminating psycho- 
logical writers and teachers. If this is not scientifically sound, I should 
be happy to have Heinlein suggest a better procedure. 

In paragraphs four and five, Heinlein seems to be scandalized by the 
results which would be obtained by converting r’s into E’s. While the 
facts of these paragraphs are substantially correct, his manner of relat- 
ing them raises several implications which deviate from the truth: 


’ 


1. Heinlein calls the E series ‘‘highly questionable,”’ so that some readers 
might infer that there is something statistically wrong with it. This is of course 
not the case. 

2. He emphasizes the fact that “the fifth category covers an extension of 
value greater than that of the first four categories.” Naturally it does. As 
Hull (7), Bingham (1), Guilford (5), Garrett (4), Conrad and Martin (3), 
Peters and Van Voorhis (9) and others have pointed out, this is the distinguish- 





bts eS, eee ORE 


gprs te | aor 

















HEINLEIN ON “VALIDITY OF QUESTIONNAIRES" 85 


ing feature of using EZ. Indeed, it is precisely this ultra-conservative interpreta- 
tion which conversion into E gives to the lower degrees of r which recommends 
its used by test-makers who want to be certain that their personality ques- 
tionnaires are adequate for individual diagnosis and prediction. 

3. Heinlein again takes my term unfair out of context and—should I say 
unfairly?——over-emphasizes it in a manner I never intended. 

4. The main implication of these two paragraphs seems to be that, while 
the verbal categories which I use in my article are too generous to those who 
have validated personality questionnaires in terms of r, the categories in terms of 
E (which I advocated for use at some future date) are too severe. Heinlein 
does not quite make clear, though, why the classification in terms of E is too 
severe, except to imply that it looks that way. 


In his final paragraph, Heinlein points out that it is fantastic to 
“attribute cause-and-effect relationship or a determining, predictive 
function to an 7 as such, irrespective of the size of the r.”” Here he is ap- 
parently attacking all attempts to validate personality tests in terms of 
r. Now I hold no brief for test validation in terms of r. Nor do several 
other recent writers. Jackson and Ferguson (8), for example, advocate 
the use of analysis of variance for the calculation of test reliability and 
validity. Guilford (6) has recommended factor analysis. Brogden (2), 
on the other hand, has recently published a paper in which he sub- 
stantially upholds the value of r as a direct measure of predictive effi- 
ciency, and contends that conversion into E is quite unnecessary. If 
Heinlein has still a different advocacy in this connection, I should be 
interested in hearing it. The main point is that in my article I merely 
noted that a good many experimenters—whether or not I, Heinlein, or 
anyone else likes the fact—did report test validations in terms of r. I was 
concerned, therefore, only with evaluating, or giving a convenient label 
to, these already reported r’s. I wholly agree with Heinlein that ‘“‘an 
adequate proof of a pragmatic kind is sorely needed to demonstrate the 
efficiency of r as an index of validity for a specific type of personality 
questionnaire or test situation."’ My verbal classifications were certainly 
not designed to constitute that kind of required proof. Nor were they in 
the least intended to settle, once and for all, the many ticklish problems 
of measuring questionnaire validity. I am rather surprised that Hein- 
lein should have ever thought this to be the case. 

In sum, Heinlein’s comment on my article on The Validity of Per- 
sonality Questionnaires does not really call into question the general 
content or conclusions of my study but appears to boil down to (1) a 
questioning of the efficacy of r as an index of test validity, and (2) an 
attack upon the specific verbal categories that I attached to r’s actually 
obtained by test researchers. On the first of these points, I am inclined 
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largely to agree with his viewpoint and to welcome its airing. On the 
second one, I can only repeat that some verbal classifications of r’s must 
often be made for review purposes; that the ones I employed were 
based on long-term psychological usage; and that if Heinlein has any 
better suggestions to make, I shall certainly welcome them. 
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BOOK REVIEWS 


MASSERMAN, JULES H. Principles of dynamic psychiatry. Philadelphia: 
W. B. Saunders, 1946. Pp. xix+322. 


In 1943, J. H. Masserman published, under the title Behavior and 
Neuroses with the University of Chicago Press, an account of a long 
series of experiments which he had been conducting on cats. The ex- 
periments were designed to illustrate and elucidate behavior described 
largely under psychoanalytical categories. Masserman’s new book, 
The Principles of Dynamic Psychiatry, extends the range of application 
of these experiments and undertakes a very ambitious program of 
formulation of principles and corollaries. 

Masserman’s system derives from the author’s beginnings as a 
psychoanalyst and from an exposure to the Gestalt tradition plus wide 
reading in the field of general psychological theory, as well as his highly 
original line of experimentation. The Principles of Dynamic Psychiatry 
is divided into two parts of which the first (six chapters) covers the de- 
velopment of behavior theory in academic psychology and psycho- 
analysis. The second part (eight chapters) develops the author’s 
system of biodynamics. He states his conception of the principles of 
behavior and finds illustrations from his experimental and clinical 
records. Masserman finds that there are four general principles. 


1. Principle I reads: “Behavior is actuated by the physiologic needs of the 
organism and is directed toward the satisfaction of those needs.” This is of 
course a very explicit adoption of the teleological point of view in its most naive 
and direct form. Masserman has no interest in investigating how a need can 
actuate behavior. He deals with end results and not means. This first teleo- 
logical principle is supplemented by three others. 

2. Principle II (experiential interpretation and adaptation) is: ‘‘Behavior 
is contingent upon and adaptive to the organism's interpretation of its total 
milieu, as based on its capacities and previous experiences.” 

3. Principle III (deviation and substitution) reads: “Behavior patterns 
become deviated and fragmented under stress and, when further frustrated, 
tend toward substitutive satisfactions.” 

4. The fourth principle (conflict) is: ‘‘When in a given milieu two or more 
motivations come into conflict in the sense that their accustomed consum- 
matory patterns become incompatible, kinetic tension (anxiety) mounts and 
behavior becomes hesitant, vacillating, erratic, and poorly adaptive (neurotic) 
or excessively substitutive, symbolic, and regressive (psychotic).”” (The state- 
ment of this fourth principle is a fair sample of the author’s style.) 


The four principles are each supplemented by numbers of corollaries. 
As may be inferred from the rather sketchy and loose statements of the 
principles given above, the corollaries consist of exceptions, relevant 
general remarks, or of elaborations of the principles. They are distinctly 
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not corollaries in the sense in which that word is used by mathematicians 
or logicians. 

In an appendix, twenty-one pages are devoted to a detailed account 
of an illustrative psychoanalysis of a neurotic personality. Three other 
appendices include a short article on psychoanalytic formulations of 
the psychoses, an article on illustrative motion picture films prepared by 
the author and his associates, and a reprinting of an article on propa- 
ganda written by Masserman in 1942. Thirty pages are devoted to 
bibliography and forty to an extended glossary of psychiatric terms. 

“Psychoanalytic theory today,’’ Masserman says, “‘. . . is far from 
an accepted body of dogma; on the contrary, a great deal of it is fluid, 
ambiguous, unintegrated, and exceedingly polemic”’ [page 93]. Masser- 
man’s book should serve to reduce this confusion and advance the 
science of behavior. The interest of psychologists in the book will prob- 
ably be directed less toward the principles and corollaries which do not 
bear very close scrutiny than toward the experiments with cats which 
are highly original in conception. Results of the cat experiments are 
interpreted as illustrations of principles and corollaries. 

These experiments in general involved a cage, a means of administer- 
ing punishment in the form of an air blast or shock and neutral stimuli 
and food rewards. They cover the simple acquisition of substitutive 
goals, fixation on part patterns, dominance, frustration, aggression, 
regression to earlier solutions, conflict-generating discriminations, ex- 
perimental neurosis, and the effects on conflict of removal from the 
situation, forced solutions of conflict relief by re-training, inter-animal 
influence on neurosis, spontaneous working through to resolution of 
conflict, and the effects of drugs, particularly alcohol, on the behavior of 
animals in conflict situations. This is a rich experimental offering. 

EpwIn R. GUTHRIE. 

University of Washington. 


Rocers, C. R., & WALLEN, J. L. Counseling with returned servicemen. 
New York: McGraw-Hill, 1946. Pp. vii+159. 


Although the words “servicemen” and ‘“‘veteran’’ are frequently 
used and the cases illustrated are drawn from among returning service- 
men, this book makes no attempt at a specific analysis of veterans’ 
problems. Instead, it attempts to place such problems in the general 
framework of broad personality maladjustments to which the technique 
of nondirective counseling may be applied. 

As a general introduction to the technique of nondirective counsel- 
ing, the book is excellent. It is simply and clearly written, and the ex- 
position of nondirective methods is orderly and reinforced by a wealth 
of carefully chosen quotations from actual counseling interviews. The 
last chapter contains practice exercises for would-be counselors, and 
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there is a selected bibliography of 23 titles in the appendix. Since the 
authors apparently envisage the practicing of nondirective counseling 
by readers whose instruction may be limited to the reading of this book 
(‘‘Where skilled supervisory criticism is unobtainable definite profit may 
be obtained by a group of counselors studying together and analyzing 
and criticizing each others’ interviews’’), one wishes that the treatment 
of nondirective counseling were not so completely positive and en- 
thusiastic, and that some mention had been made of negative instances 
or of the possible limitations of the nondirective technique in handling 
some types of problem. 
WILLIAM A. Hunt. 
Northwestern University. 


JOHNSON, WENDELL. People in quandaries. New York: Harper, 1946. 

Pp. xiv+532. 

In this book Dr. Johnson applies general semantics to the problems 
of personal and social adjustment. His thesis is (p. 45) “‘ . . . that science 
clearly understood, can be used from moment to moment in everyday 
life, and that it provides a sound basis for warmly human and efficient 
living.”” Maladjustment, in Dr. Johnson’s opinion, is essentially the 
result of Idealism which leads to Frustration and finally, to Demoraliza- 
tion. This IFD sequence, together with the myriad evaluations, labels, 
and abstractions which stem from it, leads to the conditions out of which 
many serious disorders of behavior develop. 

The author presents a convincing case for his viewpoint, and he 
presents it in a clear and entertaining style rarely found in books on this 
subject. He spices his pages with a large number of examples which are 
both pointed and sprightly. For example, when discussing extensional 
devices, the author quotes a wag who reputedly said that there are two 
kinds of people in the world: those who always divide the people of the 
world into two groups and those who don’t. 

Most semanticists seem prone occasionally to yank Aristotle down 
from his niche in the halls of time and give him a thorough drubbing. 
Also, they like to indulge a similar mass action against the philosophers 
of any period. In this, Dr. Johnson is no exception. However, he de- 
votes but a few pages to dispatching Aristotle and Aristotelian logic, 
and his potshots at philosophers are few and mild. In contrast, he treats 
the members of the medical profession with exaggerated respect, almost 
with awe. He might have presented some excellent examples of how the 
members of some medical groups created problems of a semantic nature 
by hanging labels as psychoneurotic or NP on many thousands of service- 
men. 

A minor fault in the book is found in the author’s occasional tendency 
to cast aside logical restraint when elaborating a point. On page 280 he 
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w* tes, ‘Einstein has never performed a laboratory experiment; that 
ma, » ell be a major part of the reason for his tremendous scientific 
achie. ements.” 

But such lapses are quite rare. The book represents a significant con- 
tribution in an area which has hitherto been inadequately treated. 
While it is really a book for everyone, it has a special appeal for those 
who deal professionally with problems of adjustment. Clinical psycholo- 
gists will find particular value in the chapter “In Other People’s 
Quandaries” and in the semantic exercises and research discussion at 
the end of the book. As a text or as reference reading, People in Quan- 
daries should fit in well with psychology courses which deal with be- 
havior problems. 


Irwin AuGust BERG. 
University of Illinois. 


Carp, BERNARD. A Study of the influence of certain personal factors on a 
speech judgment. New Rochelle, New York: The Little Print, 1945. 
Pp. viii+122. 

The field of speech and speech education has expanded rapidly in 
recent years. In the selection of teachers, radio announcers and speakers, 
stage and screen performers, telephone operators, etc., as well as in 
placement work in the schools, the need for improved methods of speech 
testing has become acute. 

This study attempts to determine whether audible speech can be 
tested and rated objectively. It also describes in detail the methods used 
in constructing, administering, scoring and weighting an original battery 
of speech tests called Speech Appraisal Forms. 

Speech tests were administered to 25 male college seniors and gradu- 
ate students whose test performances were rated on Speech Appraisal 
Forms by 24 specially trained judges. The resulting data were treated 
statistically by means of the Latin Square modification of the ‘Analysis 
of Variance” technique. The author recognizes the limitations of this 
method and recommends further experimentation and more precise pro- 
cedures. He concludes, however, that such factors as dress, appearance, 
poise, etc., ‘‘need not significantly affect a judge’s rating of audible 
speech.” 

This work should be of particular interest to those concerned with 
speech testing of either children or adults. It will be of value, also, to 
those interested in conducting further research ia this important field. 


St. CLatr A. SWITZER. 
Miami University. 


MorGAN, JOHN. J. B. How to keep a sound mind. (Rev. Ed. of Keeping a 
sound mind.) New York: Macmillan, 1946. Pp. vii+404. 


The point of view expressed in this book is identical with that of the 
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first (1934) edition, i.e., that mental health depends upon the develop- 
ment of correct mental habits. Ten of the 14 chapters are essentially the 
same as ones in the older book, although the titles are not always the 
same. The materials have been rearranged and improved, new sections 
have been added and others deleted. The chapters on crime, self- 
confidence, exaggeration of defects, and how to strive toward desirable 
goals have been replaced with chapters dealing with a wholesome pat- 
tern of living, developing social poise and security, and “how to be 
happily maladjusted.’’ Some of the materials in the omitted chapters 
appear at various places in the new book. 

The brief bibliography at the end of the earlier edition has been re- 
moved and a list of references included for each chapter. Considerably 
fewer than half of the titles, however, bear dates later than 1934. The 
number of study questions has been greatly reduced. There are no 
tables, graphs, or illustrations. 

The style is interesting, with many case studies and other valuable 
anecdotal material. There is, however, a lack of documentation and of 
references to experimental studies. Occasionally statements are made 
which are questionable and should be supported by data. For example 
(p. 230), ““The probabilities that a man will remain in an occupation 
depend (according to statistical studies) much more on the financial and 
domestic responsibilities which he must meet than upon his success in or 
fitness for his job.’’ No evidence is cited. 

The book, offered as a basal textbook for college classes in mental 
hygiene, attempts to ‘‘... [put] in understandable form the basic 
principles involved in the preservation of one’s own mental health.” 
This purpose is well achieved, although the book perhaps will be more 
useful as supplementary material than as a basic text in mental hygiene, 
especially with students who have had previous work in psychology. 

CLAUDE M. DILLINGER. 

Illinois State Normal University. 


STERN, EpiItH M. The attendant’s guide. New York: The Common- 
wealth Fund, 1945. Pp. xiv+104. 


Recent “‘revelations”’ of alleged mistreatment of psychopathic hospi- 
tal patients has naturally raised questions about, and brought charges 
against, attendants employed in such institutions. Stern’s little book is 
a timely instrument for improving the training of attendants. 

Divided into three parts, it treats respectively of procedures de- 
manded by patients’ general characteristics, particular kinds of patients, 
and of the attendant and his job. The first part has to do with hospital 
routine. Part Two classifies patients by dominant symptoms (from the 
view point of ward care), for example, patients ‘‘who do too much,” are 
“not fit to be seen,” ‘‘convalescents,” and other similar groups. The 
final part is a frank talk about the attendant’s responsibilities and his 
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present limited prestige status. The advantages of the occupation are 
listed as valuable experience prefatory to more advanced levels of train- 
ing, and being in on the ground floor of a coming skilled craft. 

Throughout the style is direct and non-technical. Chapters are 
replete with specific suggestions for meeting practical problems. Mrs. 
Stern’s book will make a definitely favorable contribution to the goal it 
professes, namely the improvement of attendants’ skill. 

STANLEY S. MARZOLF. 
Illinois State Normal University. 


HUNTINGTON, ELLSwortTH. Mainsprings of civilization. New York: 
John Wiley, 1945. Pp. xii+660. 


Professor Huntington’s general thesis concerning weather and 
climate in relation to human affairs is widely known through his many 
publications. This book attempts an integration of much of his work to 
date. 

The mainsprings are: A “basic evolutionary force,’’ the innate 
qualities of the human organism, the physical environment, and the 
social environment or culiure. Although discussed more specifically in 
the author’s earlier writings, these basic concepts in the present context 
are not concisely expressed. For example, the “‘great evolutionary force 
which permeates all nature” is implicitly accepted as a kind of push from 
the rear operating in the history of life. Although told that “for thou- 
sands of years civilization has been advancing along certain definite 
lines,’’ the reader is left to infer from context that these ‘definite lines’’ 
probably have to do with the development of larger and larger single 
units of social organization and with the ‘“‘betterment”’ of living gen- 
erally. Again, the innate qualities of the organism are referred to in 
very genera] terms, sometimes almost as a primary good, insuring 
human progress. 

The writing is enthusiastic and vigorous and will undoubtedly appeal 
to those who seek to establish correlations among the several disciplines 
addressed to the study of human behavior. The book will be read with 
some skepticism by those who are well acquainted with the “‘ifs,’’ “‘ands” 
and ‘‘buts’’ pertaining to the facts established within any one discipline. 

Professor Huntington disavows the theory of innate race differences; 
he states that all peoples or stocks contain qualities which will enable 
them to adapt and to develop culturally, but that these qualities develop 
or fail to develop depending upon the presence or absence of appropriate 
environmental conditions. An ingenious comparison of Newfoundland 
and Icelandic cultures illustrates this interplay of organic quality and 
environing circumstances. 

Social psychologists will be interested in the treatment of character 
and inheritance, the place of “‘kiths’”’ (“a group of people relatively 
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homogeneous in language and culture and freely intermarrying with one 
another’) in history, and the significance of selective migration and 
environment in the development of kiths. Likewise, the consideration of 
weather in relation to mental activity and behavior, diet and national 
character, and cycies in human activity deserve attention. 

Psychologists will find Huntington’s criterion of mental activity 
somewhat limited: ‘‘A good measure of intellectual activity on a large 
scale is the circulation of books by libraries, especially ordinary city 
libraries. People read serious books more frequently when their minds 
are active than when they are inert’ (p. 344). Armed with library 
circulation figures from a number of American cities, the author demon- 
strates a seasonal and a weather linkage with reading and advances a 
theory of heightened mental activity in relation to increased atmos- 
pheric ozone. 

In only a few instances, chiefly citations from literature, are correla- 
tion coefficients presented. At one point (p. 230), the author refers to r’ 
as giving the ‘‘per cent of the resemblance between two variables due to 
a common cause’’; it might be more accurate to speak of ‘‘common 
elements.’’ This apparent lack of distinction between concomitance and 
causation appears at a number of points throughout the book. Causal 
relationships are freely deduced or inferred from observed associations. 
In practically no instance is the degree of the association established 
in terms of a correlation coefficient, or are the observed associations sub- 
mitted to a test of significance. 

The author accepts the concepts of multiple causation and of 
multiple correlation, but he experiences great difficulty in applying 
them to his data by verbal, descriptive methods. There is an almost 
complete absence of the concept of reliability of a statistic. When per- 
centages are compared, differences which fit a “‘trend,’’ no matter how 
slight, are accepted as “‘real.’’ Occasionally, the author will advance 
rational explanations for slight reversals in a trend. 

The chief problem which psychologists will have in evaluating this 
work is that of the quite different framework within which the author 
operates. His interest in human behavior is oriented toward man in the 
mass rather than toward man, the individual. He concentrates on trend 
rather than on variation. Huntington’s procedures probably do reveal 
facts which analysis, by segmenting data, tends to overlook. Roughly 
analogous situations occur in the relation of range of measurement to 
magnitude of correlation, and in the use of the correlation coefficient 
for actuarial prediction as contrasted with prediction of the single case. 

But in spite of the author’s apparent unfamiliarity with psychologi- 
cal methods and findings, this study has considerable value for psycholo- 
gists and their work. It reveals a body of material of undoubted 
significance for human life which can greatly complicate the psycholo- 
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gist’s effort to account for variance in, and to predict the course of be- 
havior. It affords a mass of ideas which could well command the re- 
search energies of a corps of social psychologists for a lifetime, trans- 
lating the ideas into hypotheses capable of experimental test. 
DALE B. HARRIs. 
University of Minnesota. 


SMITH, BrucE LANNEs, LASSWELL, HAROLD D., & CAsEy, RALPH D. 
Propaganda, communication, and public epinion; a comprehensive ref- 
erence guide. Princeton: Princeton Univ. Press, 1946. Pp. v+435. 


This book represents a continuation of Propaganda and promotional 
activities ; an annotated bibliography, also written by the present authors, 
and published in 1935. It lists titles of and briefly comments on nearly 
3,000 selected books, periodicals, and articles, most of which appeared 
between mid-1934 and March, 1943. In addition to this annotated 
bibliography, which fills approximately two-thirds of the book, there 
are certain other features of general interest. 

Four chapters at the beginning cover important background mate- 
rial: channels of communication, political communication specialists, 
description of contents of communications, and description of effects of 
communications. In his section on communication channels, Casey 
briefly reviews the historical development of American communications 
as influenced by the rise of democracy, industrialization, and urbaniza- 
tion. The economic aspects of communication are discussed, with 
special reference to costs of and relationships between radio broadcast- 
ing and newspaper publishing. 

Smith follows with a discussion of the major political communica- 
tion experts of our times. Information is given in tabular form concern- 
ing the heads and propaganda ministers during World War II of some 
of the larger nations. Their occupational origins are noted and data are 
presented concerning their fathers, and their own careers. Further 
tables give such facts about them as estimated incomes of fathers, 
childhood exposure to authoritative symbols of society, formal educa- 
tion, socio-economic status during first decade of employment, and 
ages. 

In the third and fourth introductory essays Lasswell attacks the 
problems of describing the contents and determining the effects of com- 
munications. Analysis of contents can be done by a listing and com- 
parison of themes, by counting of socially optimistic and pessimistic 
expressions in speeches, and noting values expressed in motion picture 
characterizations. It is desirable to base content analysis on insight into 
subject response. Effects may be judged by votes in free elections, ac- 
tions by public officials, by people in direct contact with the movements 
involved, etc. 
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The bibliography itself is arranged in sections covering (1) strategy 
and technique, (2) promoting groups, (3) response to be elicited, (4) 
symbols utilizec, (5) channels, (6) measurement, and (7) control and 
censorship. Entries are arranged alphabetically by titles within sections. 
One hundred and fifty outstanding titles are starred in the listings and 
commented on relatively more extensively. This sectioning and 
alphabetization is supplemented by a comprehensive combined subject 
and author index. 

As in the case of many books by several authors, the individual sec- 
tions are not too well integrated, although in themselves interesting and 
informative. Annotations do not seem to follow any particularly sys- 
tematic pattern, covering in various combinations the actual contents, 
their significance, their value, and the backgrounds of the authors. This 
approach sometimes leads to weak notes, but on the whole the annota- 
tion is quite adequate for most purposes. The 42-page author and sub- 
ject index should add greatly to the usefulness of the bibliography. 

This book is a reference work of great obvious value, indispensable 
to any adequate library. 

R. B. AMMONs. 

University of Denver. 


DEWEY, JOHN. Problems of men. New York: Philosophical Library, 
1946. Pp. iv+424. 


Except for a prefatory note and an introduction of 18 pages, this 
book consists wholly of a reprinting of articles published during the 
past 12 years in 14 different journals and magazines, together with one 
paper originally published, as stated in the prefatory note, “half a 
century ago”’ (exact time and place not indicated in the book, but ac- 
tually in Decennial Publications of the University of Chicago, 1903). 

A reviewer in The New York Times (June 9, 1946) has called this “‘a 
mighty book.’’ This may be true in the sense that its author is a mighty 
man of influence, through more than half a century, on psychology, 
education, and philosophy. His influence was felt earliest on psychology 
(functionalism) through his 1886 book, Psychology, and through his 
1896 article, ‘‘The Reflex Arc Concept in Psychology.”’ Since the turn 
of the century his influence has been greatest on educational theory and 
practice (“‘progressive education”’) and on philosophy (instrumentalism). 
Whatever book he writes will command the respect of thoughtful 
readers. What E. G. Boring wrote of Dewey in 1929 is equally true 
today,—‘‘While he has had in the last twenty-five years little effect 
upon psychology proper, he has led a very influential life in its effects 
upon intellectual America, expounding repeatedly the problems of 
human nature’ (A History of Experimental Psychology, 542). 

Except for the magic of the author’s name, however, this book is not 
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great as a book, since it was originally written, not as a book, but as in- 
dividual articles, each complete in itself, for a wide range of journals and 
magazines from The Journal of Philosophy to The Rotarian. Few readers 
except reviewers and devotees of Dewey’s philosophy in its entirety will 
have the persistence to read the whole book. The chapters on education 
will be of interest to the general reader, dealing as they do with a 
vigorous defense of democracy and the experimental method against 
all forms of external authority, whether these are based on classical 
tradition, medieval supernaturalism, or present-day demands for “‘a 
moratorium on science.’’ On the other hand, all the chapters of Part III, 
except one, are from The Journal of Philosophy; and they consist largely 
of technical philosophical arguments of interest mostly to other techni- 
cal philosophers. The uncut pages typically found in numbers of that 
journal in the average college library bear witness to this statement. 

One reviewer has spoken of Dewey’s “inimitably cluttered prose”’ 
(Time, June 26, 1946). While this is a true enough characterization in 
general, there are instances in this book of chapters, like the one on 
James Marsh and the first of the two on William James, so well written 
as to make applicable to Dewey himself what he says of Bertrand Rus- 
sell,—‘‘His lucidity and felicity of expression are ever the despair of 
lesser writers, and in. . . [these chapters] he has almost surpassed him- 
self” (p. 171). 

WESLEY R. WELLs. 
Syracuse University. 


RAND, W., SWEENY, M. E., & VINCENT, E. L. Growth and development of 
the young child. Philadelphia: W. B. Saunders, 1946. Pp. vii+481. 


The fourth edition of this elementary text of child growth and de- 
velopment appears six years after the third revision. The subject matter 
is almost completely reorganized, but much of the old content remains 
essentially unchanged. The new edition is arranged in topical sections— 
physical, intellectual, social and emotional, etc.—whereas, the earlier 
edition considered all of these topics in sections devoted to different 
chronological periods of development. While the more recent treatment 
provides a more logical and continuous approach to child development, 
it may make the book more cumbersome and inconvenient for parents 
who wish to read as their children develop. 

The sections of this new edition devoted to emotional, intellectual 
and social development have beén expanded and a chapter on current 
concepts of growth and development has been added. The chapter on 
current concepts reflects a significant trend in child care—a trend away 
from Watson’s dicta toward the contemporary philosophy of self- 
regulation of diet and development as proposed in recent publications 
by Gesell and Ilg, Ribble, the Aldriches, e¢ al. An example of this shift 
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in emphasis is the authors’ discussion of thumbsucking. In the third 
edition they state, ‘‘Mechanical restraints are sometimes helpful if used 
before the habit has become so important to the child, or if used with 
older children as a reminder when they themselves have decided to 
conquer the habit.”” In the recent revision one finds ““The older recom- 
mendations for the use of mechanical restraints, of rewards and punish- 
ments and other adult imposed devices are now considered not only 
poor practice, but in all probability risky to the future well-being of the 
child.”” References to the recent publications of Gesell and Ilg, and 
Ribble are copious. 

Although the new edition contains 417 references, as contrasted with 
only 221 in the third edition, one still finds many aspects of psycho- 
logical development ignored, sketchily treated, or inadequately sup- 
ported with available, published research data. In some cases, definite 
rules are laid down for child guidance without any reference to research 
findings. This, in many cases, appears to be an effort to extend a hand of 
aid and comfort to parents who want something definite—correct or in- 
correct. 

Since this revision retains much of the previously published material 
and includes a considerable amount of recent research, it should be ac- 
cepted and appreciated by those who have found the earlier editions 
useful. In the reviewer’s opinion this edition should be a helpful source 
book for a highly selected group of parents and a better-than-average 
textbook for an elementary course in child care and training. 

GEORGE G. THOMPSON. 

Syracuse University. 


BEAUMONT, HENRY. The psychology of personnel. New York: Longmans, 
Green, 1945. Pp. xiii +306. 

BEAUMONT, HENRY. Psychology applied to personnel. New York: 
Longmans, Green, 1946. Pp. viii + 167. 


The Psychology of Personnel is intended for employers and as a text 
in a general course for college students. The level of the text is such that 
a previous course in psychology would not be necessary as a prerequi- 
site. Psychology Applied to Personnel is a work-book to accompany the 
text. 

The Psychology of Personnel contains the following 11 chapters: I. 
Understanding Employees, II. Analyzing Jobs, III. Selecting Em- 
ployees, IV. Training Employees, V. Working Conditions, VI. The 
Workers’ Health, VII. Promoting Safety, VIII. Supervision, IX. Merit 
Rating, X. Providing Incentives, XI. Occupational Adjustment. Cita- 
tions from 59 industrial and other organizations, which are an impor- 
tant contribution of the book, give examples of personnel procedures. 
Thirteen organizations are cited with reference only to Training Em- 
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ployees. There is no author index, and in fact no author or almost none 
is cited by name in the entire book. There are no tables, no figures, and 
practically no psychological statistics. This type of presentation of the 
material which omits the names of investigators and statistics will preb- 
ably appeal to the average reader. 

As is somewhat implied by its title, the treatment of The Psychology 
of Personnel is intermediate between books on industrial psychology and 
personnel administration. The absence of statistical results and freedom 
from detailed reports of investigations makes it appear closer to the 
usual book on personnel administration than to the usual book on in- 
dustrial psychology. 

Beaumont has succeeded in writing a general book that is probably 
simple enough not to intimidate anyone. Almost all of the available 
conclusions of psychological applications to personnel are presented. 
On the other hand there is little perspective shown in pointing out major 
emphases. There are an unusually large number of topics included for a 
book of this size. Besides the topics expected from the chapter headings 
there are a good many surprising ones as for example a brief discussion 
of Planned Parenthood (p. 165). The simplicity of the book occasionally 
spills over into elaborating the obvious as “In all plants, offices, and 
stores where women are employed there is a need for separate sanitary 
facilities.” And with reference to evaluating experience, an entire page 
is devoted to pointing out that whether the experience of an auto 
mechanic in Detroit will be indicative of success in Tulsa depends on 
level of performance, pay level, hours of work, location of plant, and job 
environment (p. 60). 

In addition to presenting a great many examples of personnel prac- 
tices in American industry and business, considerable emphasis is 
placed on Army practices and on the problems of servicemen returning 
to civilian employment. 

The two fullest topics are those of training and selecting supervisors 
by tryout. 

Psychology Applied to Personnel is made up of two parts, Part I: 
Personnel Statistics, and Part II: Notes, References, Questions, and 
Applications. 

Part I: Personnel Statistics, gives the method and blank work sheets 
for calculating frequency distribution, measures of central tendency, 
range, standard deviation, group and individual comparisons, sig- 
nificance of traits, and product-moment correlation. A set of actual 
data on 94 cases from a company is given. These data are used in the 
illustrative problems and are to be used in assignments. These assign- 
ments are interesting. Uncharacteristic but nevertheless unfortunate is 
a slip in explaining correlations (p. 36). The explanation assumes that 
the relative importance of coefficients of correlation is of the same 
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magnitude as the size of the coefficients rather than the square of the 
coefficients. This error is not cleared up under The significance of cor- 
relation, when referring to a coefficient of —.09, ‘* .. . this relationship 
is not a close one so that there will be many exceptions to this general 
rule.’”” Wonderful understatement! 

Part Il of Psychology Applied to Personnel has chapter headings 
identical with those of The Psychology of Personnel. Twenty-five good 
true-false questions are presented for each of the 11 chapters of the text. 

A large number of up-to-date references are given with respect to 
each major topic of each chapter. These references are from a variety 
of sources, industrial magazines, personnel journals, advertising or semi- 
advertising publications by companies, government pub’ications, and 
the psychological literature. The references should be particularly use- 
ful to teachers. For the average employer or other student there are too 
many references and insufficient annotation to be of greatest usefulness. 
A deficiency with respect to a number of references to government 
sources is that the references are not satisfactorily definite, as for ex- 
ample, ““The Women’s Bureau of the United States Department of 
Labor has available several publications dealing with proper provisions 
for the employment of women in industry” (76). 

All in all these two books by Beaumont are a definite contribution in 
presenting personnel psychology at the intended elementary level. 

THoMAs W. HARRELL. 

University of Illinois. 


MAIER, NORMAN R. F. Psychology in industry. New York: Houghton 
Mifflin, 1946. Pp. xvi+463. 


Reports during the last decade describing studies of the effect of the 
social milieu in industry are reflected in this new Psychology in In- 
dustry. Although most of the topics covered are those found in the con- 
ventional industrial psychology text, the emphasis and amount of 
space allotted to each topic is considerably different. While this book is 
not a manual of advanced techniques, the discussion is by no means 
shallow. 

About one-quarter of the book considers general principles and 
causes of behavior, and the development of attitudes and morale. The 
problems of the worker on the job and his relationships to it, his fellow 
workers, and to management are discussed in detail. Maier feels that 
one must understand human behavior as it exists rather than to blame 
and punish. Man’s behavior is a product of multiple forces and these 
must be changed before behavior changes. ‘Causation implies that a 
given individual in a given situation must do as he does.’”” Management 
should seek causes rather than treat symptoms. Reference to the 
Western Electric studies shows the effect of social groupings, attitudes, 
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and morale upon performance. Since the background for the develop- 
ment of attitudes in the cases of management and labor is different, and 
since there is no “‘right,’’ mutual understanding of the other’s point of 
view is necessary. 

The section on individual differences, ratings, and the use of tests is 
elementary and general. Where other texts become expansive and 
specific, Maier deliberately curtails. Practically no specific tests are 
mentioned although the reader is referred to other sources. This section 
of the book is the weakest and most poorly organized. Even a non- 
technical review should emphasize the extreme importance of criteria. 
The consideration of job analysis (and its relationship to selection of 
adequate criteria) should precede material on selective and evaluative 
instruments. 

Fatigue, time-motion analysis, accidents, and the working environ- 
ment (illumination, atmosphere, noise) receive adequate attention. The 
author comments on the techniques employed by the industrial engi- 
neer, but his emphasis is on changes in the behavior of workers brought 
about by improved conditions. Training and motivation get standard 
treatment. Perhaps the topic of motivation should have been included 
in the earlier chapters concerned with causation in behavior. Labor 
turnover is interpreted as a symptom of dissatisfaction and manage- 
ment is advised to ferret out its causes. The final chapter reviews 
briefly, but perhaps too repetitiously, the material covered in the pre- 
ceding chapters with special emphasis upon its use by the supervisor, 
the industrial counselor, and higher management. 

A number of errors or questionable statements can be found, but 
probably not more than tend to creep into any text. For example, there 
is an implication that any test that will correlate .30 with a criterion is 
good, and while one grants that predictions will be better than chance, 
the implications of such a low validity coefficient should have been 
clarified. In fact, the author once implies that any test is better than no 
test. In another place one finds the statement, ‘Intelligence tests... 
are aptitude tests in the sense that education and experience have little 
or no effect on the score.’’ One statement says that jobs requiring 
dexterity and mechanical operations show no relationship with intel- 
ligence, but the following sentence indicates that lower levels of in- 
telligence are sometimes more satisfactory on these jobs. In reference 
to time-motion economy, the view that, ‘The right and left halves of a 
man’s body are mirror images of each other,” disregards the functional 
inequality of opposite sides of the body. 

The physical appearance of the book is good but appropriate il- 
lustrations might make for higher interest, particularly in view of its 
intended audience. The few references made are found at the bottoms 
of pages which allows easy reading. A bibliography by chapters is found 
in the appendix. 
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If used as a text, supplementary readings would be desirable. Cer- 
tainly space allotments do not actually reflect the amount of experi- 
mental information available in each area. However, the change in 
emphasis toward the social and individual aspects of work and the 
variety of applications suggested make refreshing and interesting read- 
ing for the student of industrial relations. 

LEsTER P. GUEST. 

The Pennsylvania State College. 


HARRIMAN, P. L. (Ed.) Twentieth century psychology. New York: 
Philosophical Library, 1946. Pp. xiii+712. 


This book does not fulfill the expectations implied by its attractive 
title. The reader who expects a survey of contemporary psychology will 
be disappointed. It is a collection of thirty-nine articles by an assort- 
ment of writers, most of them well-known to psychologists, and some of 
them heavyweight authorities. But twenty-five of the articles are re- 
prints from current periodicals, the majority easily accessible to profes- 
sional psychologists. Periodicals represented more than once include the 
Journal of Social Psychology and the Journal of Abnormal and Social 
Psychology, with four articles each, the Journal of General Psychology 
with three, and the Psychological Review with two and the larger part of 
another. Thus, many of the contributions will not be new to the profes- 
sion. 

According to the editor, the volume was prepared for the general 
reader, but it is uncertain what general reader he had in mind. The 
psychologically untrained reader will find the technical jargon too much 
for him. The student coming up in the field will find much better 
volumes for parallel reading in his course work, until he is ready to go 
directly to the periodical literature. About the only general reader I can 
think of for whom the volume might have been designed is the specialist 
in related fields of knowledge, e.g., philosophy, the social and natural 
sciences, etc. Such a reader is apt to get as much impression of what is 
wrong with psychology as of what psychologists are contributing of use 
and value to him. 

One thing which would appear to be wrong is that psychologists 
seem to write with a heaviness of touch that is well-nigh depressing. Let 
me cite only two of many possible examples. ‘‘How may differences in 
subgroup structure, group stratification, and potency of ego-centered 
and group-centered goals be utilized as criteria for predicting the social 
resultants of different group atmospheres?’’ ask Lewin, et al. (200). 
‘We may say that the disturbance of the tentatively established move- 
ment-stereotypy at this critical point disordered the manner in which 
the focal stimulus-patterns were encountered just before blocking oc- 
curred in the given blind, thereby interfering with expansion of the 
nuclear blind-conditioning process in the segment,’’ says Schnierla 
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(291-292). The general reader who wishes to know more about what is 
being discussed in passages like these may refer to previous publications 
of the writers. He may, in short, begin specializing in psychology and, 
in fact, in their brand of psychology. 

The book may also be censured for a carelessness of production that 
is hardly excusable on the ground of wartime difficulties. There are 
many typographical errors, occasional misplaced footnotes, and one 
complete failure of correspondence between illustrative figure and text 
(421-424). The text refers in capitals to A, B, C, D, E, F, Gand H, and 
to shaded areas HEG and GFH. The figure, if such it be, though it isn’t 
so labeled, has the lower case letters a, b,c, d, e, f and g scattered over 
the page, and no shaded area. Finally, there is a two-page index of one 
column to the page with a total of 70 entries for the more than 700 
pages of material. 

A third weakness is an emphasis, unbecoming of a field struggling to 
assume a proper position in the natural sciences, on the armchair 
method. One may have no objection to the use of intuition and insight 
whenever possible, but if Klein, who writes in defense of this method, 
wishes to see some prime examples of what it would contribute to the 
advancement of knowledge, he has only to read the closely following 
articles by Maslow and by Brunswick. The first refers to some per- 
centages of strength of motivation (40) that are plainly imaginative, 
and the second is abracadabra. ’ 

Lest this review suggest that there is nothing worthy of publication 
(or republication) in the book, I might mention that I enjoyed many of 
the original articles, including most of the review of his research by 
Schnierla, the article on conditioning by Harris, Brandt’s survey of his 
ocular photography (although a little more modesty might be in order 
in the descriptions of his apparatus), and, among the reprints, Har- 
rower-Erickson’s outline of Rorschach methods. Also, the article con- 
tributed by the editor is presented simply enough to indicate that he, at 
least, knew what general reader he had in mind. But these seem to bea 
meagre return for the heavy expenditure of time necessary to go through 
the whole volume. 

Gero. M. PETERSON. 
University of New Mexico 
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